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Preface 


This booklet is intended for high-school students interested 
in mathematics. It is concerned with approximating real num¬ 
bers by rational ones, which is one of the most captivating 
topics in arithmetic. 

In the last decade, some young mathematicians, and not 
only young mathematicians, have shown a negligent attitude 
towards “classical” and “pure” mathematics in contrast with 
“modern” and “applied” mathematics. This stance is fully 
unjustified. 

First, mathematics rests on a foundation of numerous clas¬ 
sical theories, facts and findings which must be known to every 
mathematician. For instance, the theory of continued fractions, 
a part of classical pure mathematics, is widely used nowadays 
to calculate numerical values of functions by means of com¬ 
puters. 

Second, while science develops, many of its theories become 
obsolete and “dry up”, like some branches of a tree. Quite a 
few do, yes, but not all of them. There are theories which 
survived centuries (or even millenia) and still retained their 
significance. 

Continued fractions represent one of the most perfect crea¬ 
tions of 17-18th century mathematicians: Huygens, Euler, 
Lagrange, and Legendre. The properties of these fractions 
are really striking. 

The following should be borne in mind when reading this 
booklet. 

Topics easily understandable are presented in normal print, 
while those more difficult are given in small print. Proofs of 
some theorems given in small print may be omitted safely. 
These theorems will necessarily be taken for granted. 

However, mathematics is not just reading for entertain¬ 
ment. A future mathematician as well as a physicist or an 
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engineer has to acquire skill in dealing with mathematical 
constructions and proofs. So take a pencil and a sheet of pa¬ 
per and study carefully the topics given in small print. You 
may succeed in simplifying some proofs or finding better 
ones. 

The theory of continued fractions is vast. This booklet 
covers only its fundamentals, but it contains everything that 
may be useful for a layman interested in mathematics. Pro¬ 
fessional mathematicians need to know much more. 

Nikolai Beskin 



Chapter 1 

Two Historical Puzzles 


1.1. Archimedes’ Puzzle 

1.1.1. Archimedes’ Number. Many people believe that only 
a distant journey, preferably to outer space or the ocean 
bottom, could enable them to meet anything extraordinary, 
for the everyday life is so familiar that can show up no un¬ 
usual facets. 

What a delusion it is! Our surroundings are full of puzzles 
which go unnoticed because they seem to be habitual. 

This chapter tells us a story of two enigmatic, yet familiar, 
episodes from the history of mathematics. 

High-school students the world over know from the course- 
in geometry a symbol jx which denotes the ratio of the cir¬ 
cumference of a circle to its diameter. 

The letter jx is the first letter of the Greek word 
Jiepicpepeia which means “circle”. An English mathematician 
Jones was the first to introduce the symbol jx in 1706. In 
1736 Euler adopted this notation instead of the symbol p he 
previously used. Since then the symbol jx has come into gen¬ 
eral use. 

From the most ancient times mathematicians sought a 
value for the number jx. Archimedes determined its approxi¬ 
mate value as 22/7*. This fact is so well known that hardly 
anybody suspects that it conceals a mystery. Who ever asks 


* Actually Archimedes gave a different formulation to this result 
in his book On the Measurements of the Circle. He determined for ji 
10 1 

its bounds: 3 ^ <; ji <3 y . To quote Archimedes, “The circumference- 

of any circle equals three times the diameter plus an excess which 

10 

is less than one seventh of the diameter but greater than of it.” 

10 1 

Although the value of ji is closest to 3 as compared to 3y , 

1 

the simpler value 3 y is the one in general use. 



why Archimedes chose a fraction with 7 for denominator? 
What would happen if jx were approximated by a fraction 
with denominator 8? 

This question proves to be of extreme interest. 

1.1.2. Approximation. Mathematicians often encounter a 
problem of replacing an object (a number, a function, a fig¬ 
ure, etc.) by some other object of the same nature, which is in 
some sense sufficiently near to, but simpler than, the original 
one. This replacing is called the approximation. In the general 

oc J+ct 
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Fig. 1 


case it requires that a set of objects be singled out and the 
sense of the phrase “sufficiently close to” be defined. We shall 
not discuss this general problem and restrict ourselves to the 
approximation of real numbers. 

Let us consider the set of all real numbers. The conven¬ 
tional notation for this set is R. Real numbers may be of com¬ 
plicated nature, e.g. irrational numbers, or be cumbersome, 
e.g. fractions with large denominators. 

It is worth explaining why cumbersomeness of a fraction is 
evaluated by its denominator. (We remind that a fraction is 


a number y where p and q are integers , and q 0; therefore, 
and y are not fractions.) If we are mainly interested not 


in the magnitude of a real number a but in its arithmetic 
nature, we need to know the position of a between two con¬ 
secutive integers n and n + 1. The addition of an integer to 
the number a will not change the arithmetic nature of a 
(this statement does not hold for that branch of arithmetic 
which deals with integers). Figure 1 shows two numbers a 
and 3 + a identically located within segments [0, 1] and 
C3, 4] (the term “segment” is defined on p. 41). For instance, 
391 3 3 

the numbers y = 97 y and y are identically located within 

the corresponding segments [97, 98] and [0, 1], and thus there 
are no reasons to regard the former as being more complicated 
than the latter. This implies that an analysis of the nature of 
the numbers within the segment [0, 1] would be quite suf¬ 
ficient since the same pattern is reproduced within each seg- 



ment [n, n This is why we are concerned only with the 

denominator when evaluating the cumbersomeness of a frac¬ 
tion. 

Let us single out a subset of fractions with a given denomi¬ 
nator q from the set R of all real numbers. The distance be¬ 
tween a number a and a fraction — is a—— . Now we can 

Q I Q 

give an interpretation of the 

problem of the approximation t °f 

of real numbers as follows: to p-i 0 

approximate a real number a by q 

a fraction with q denominator 

which is the closest to a among 

all fractions with q denominator . ( oC 

If we mark all fractions with ^ &~SL i" 
q denominator on the number ? ~ 7 q 

line, the number a will fall 
within an interval between Fl S* 2 

two fractions or coincide with one of them. The latter case is 
trivial, and we can write that 


Of these two fractions the one nearest to a is chosen as its 
approximation (Fig. 2). 

It could happen that a is the middle point of the segment 
. This and only this case implies that there exist 

two solutions of the problem. For the sake of definiteness, we 
choose to adopt the left-end point of the segment as the ap¬ 
proximation of a. 

It is clear, therefore, that a fraction with any denominator 
can approximate the number a, that is, the choice of q deno¬ 
minator is a matter of preference. 

Approximation is employed when it is necessary to use a 
rational number instead of an irrational number. It is also 
applicable to replacement of rational numbers by less cumber¬ 
some ones, i.e. by numbers with smaller denominators. For 

2936 

instance, the approximation of the number by the frac¬ 
tion with denominator 12 is 
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since 


5 2936 6 

12 < 7043 < 12 ’ 


, 2936 . . 5 . 6 

where is nearer to than to . 

The approximation of real numbers by decimal fractions has 
long been in general use. However, decimals were yet unknown 
in Archimedes’ time*, and he could choose fractions with 
arbitrary denominators to approximate the number ji. Why 
did he prefer fractions with denominator 7? Could it be 
purely accidental? 


1.1.3. Error of Approximation. A real number a is approxi¬ 
mated by a fraction -y with an error 


A = a—— , 


where -y stands for the end point of the segment j^ - - 1 , -y 
which is the closest to a. 

The error is thus the exact value of a minus its approximation. 
Therefore, the error is positive if —= —, and negative if 

A 

P _ P—1 


Q Q 

The absolute value | A | of the error is called the absolute 
error. 

1 

It is clear that the absolute error does not exceed -y (see 
Fig. 2): 

|4K^-. 

The number -y is the upper bound of absolute error. The 
upper bound depends on the choice of approximation. For 


* Decimal fractions became known in Europe at the end of the 
16th century, although in the Orient they had been used since the 
end of the 15th century. They were invented by the Flemish scientist 
Simon Stevin. Here is what the English writer Jerome K. Jerome had 
to say on the matter: “From Gent we went to Bruges (where I had 
the satisfaction of throwing a stone at the statue of Simon Stevin, 
who added to the miseries of my school-days by inventing decimals), 
and from Bruges we came on here.” (Diary of a Pilgrimage , the entry 
for Monday, June 9.) 


12 



instance, if we agreed to approximate the number a by the 
left-end point of the segment P ~ 1 , yj , then the upper 

bound would be —. 

q 

1.1.4. Quality of Approximation. The absolute error approach¬ 
es the upper bound if a is the middle point of the segment 

y] • This is the most unfavourable case. If, however, 
a is very close to one of the end 

points, the actual absolute error JL 

may be considerably smaller ? ot ? 

than the upper bound. 1 u inh 

This observation suggests p-j ^ ^ ^ p 

that the evaluation of the —— ^ 

quality of approximation is re- t Q j 

quired. It is clear that the ap- Low quality 

proximation of a number a by a 
fraction with a small denomi- Fig. 3 

nator is appropriate if the error 

is small ; or, to be more precise, if the absolute error is sub¬ 
stantially less than the upper bound of the error (Fig. 3). 

In order to evaluate the quality of approximation, we have 
to estimate the ratio of the actual absolute error to the upper 
bound on the absolute error 


Low quality 


Fig. 3 


_ absolute error _ = |a — p/q\ _q . a _ . 

upper bound on absolute error — 1/2 q ^'' 

It is convenient to consider one half of this ratio denoted 
by h and called the normalized error , 

h=\qa — p\. (1) 

The normalized error h is thus one half of the ratio of the 
actual absolute error to the maximum possible error. It is obvi¬ 
ous that 

The quality of approximation is the higher, the less h is. 
We call the quantity 

^ = ~2h = 2 

the quality factor. It has a simple and lucid meaning: The 
quality factor of approximation is the factor by which the actual 
absolute error is less than the maximum possible error. It is ob- 
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vious that 


1 ^ X < oo, 

and the greater X, the better the approximation. 

It would be wrong to expect fractions with greater deno¬ 
minators to be more useful. It could happen that the approxi¬ 
mation of the number a by a fraction with the denominator 8 
is less accurate than that by a fraction with denominator 7. 
Let us have a look at the number n, approximated by fractions 
with denominators from 1 through 10 (see Table 1). We omit 
the calculations, leaving them to the reader. 


Table 1 


Q 

Approximate 
value of ji 

Upper bound on 
absolute error 

/ A / 

h 


1 

3 

1 

- U = 0.5000 

0.1416 

0.1416 

3.5 

2 

6 

2 

4- = 0.2500 

0.1416 

0.2832 

1.8 

3 

9 

3 

4- = 0.1667 

0 

0.1416 

0.4248 

1.2 

4 

13 

4 

4r = 0.1250 

O 

0.1084 

0.4336 

1.2 

5 

16 

5 

Si¬ 

ll 

© 

0.0584 

0.2920 

1.7 

6 

19 

6 

- i -= 0.0833 

0.0251 

0.1504 

3.3 

7 

22 

7 

v* 

t — 

© 

o 

il 

Hs 

0.0013 

0.0089 

56.5 ( 1 ) 

8 

25 

8 

4 t = 0.0625 

16 

0.0166 

0.1327 

3.8 

9 

28 

9 

-U = 0.0556 

0.0305 

0.2743 

1.8 

10 

31 

10 

^■= 0.0500 

0.0416 

0.4159 

1.2 


This table demonstrates that the approximation of n by 
fractions with denominator 7 is more accurate than that by 
the other fractions. The actual error is less than its upper 
bound by a factor of 56.5. 

Figure 4 shows the location of jt on the number line. Acci¬ 
dentally (but is it indeed accidental?) jx happens to be quite 
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j 

close to 3^. If it were prescribed to approximate n with the 

absolute error less than or equal to 0.0013, how would we 
proceed? We would write down the condition 

4-<0.0013, 

lq 


whence q > 385. Archimedes had achieved the same accuracy 
using a much smaller denominator. It is worth mentioning 
here that fractions with denominator 385 make it possible to 
approximate any real number with an error less than 0.0013 r 
while fractions with denomina¬ 
tor 7 are more preferable for 
approximating the jx number. 

Archimedes’ choice could not 
therefore be accidental. But 
how did he make that choice? 

Many centuries later (in 1585) 
a Dutch scientist from Metz, 

Adriaen Antoniszoon (also 
known as Adriaen Antonisz) 
found an approximate value 


H- 

3 

i 

-r“J> 

1 J 6 

1 

1 

1 

3 

_ 

1 1 1 
j 3 7 

1 

1 

_i_i_ 

^ 1 * - 

•7 *7 ' ^ 


355 . 
113 for 


Fig. 4 


This result has been published after Antoniszoon’s death r 
by his son Adriaen Metius, so that the value 355/113 is tra¬ 
ditionally called Metius ’ number. Metius’ number has the 
same striking property as Archimedes’ number: the actual 
error is less than it could be expected for the denominator 113. 
We invite the reader to examine Metius’ number in the same 
way as Archimedes’ number has been analysed. 

There is no doubt that Metius’ number was not an acciden¬ 
tal discovery. In fact, it was known long before Adriaen Anto¬ 
niszoon happened to find it (see, e.g. Struik’s book in the 
Bibliography). 


1.2. The Puzzle of Pope Gregory XIII 

1.2.1. The Mathematical Problem of the Calendar. Pope 
Gregory XIII was not a mathematician but his name is asso¬ 
ciated with an important mathematical problem, that of the 
calendar. 

Nature has supplied us with two obvious time units: the 
year and the day (solar day). We even read in one old text- 
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Look on cosmography: “Unfotunately, the year does not com¬ 
prise an integral number of days.” We could not but agree 
with this complaint because the fact does bring a lot of in¬ 
convenience. However, it also generates an interesting mathe¬ 
matical problem. 

1 year = 365 days 5 hours 48 minutes 46 seconds 
= 365.242199 days*. 

It would be impossible to enact and implement this dura¬ 
tion of the year in civil life. But what if the civil year is 
declared exactly 365 days long? Figure 5 shows the orbit of 
the Earth. On January 1, 1985, at midnight, the Earth was 
at point A. On January 1, 1986, at midnight, it will be at 
point 5, and next January 1 it will be at point C\ and so 
forth. As a result, if we mark on the orbit the position of the 
Earth corresponding to a fixed date, this position will not 
be the same each year but will retard by nearly six hours. 


* Neither the astronomical aspects of the calendar (such as vari¬ 
ation in the length of the year) not its history are analysed here in 
detail; we concentrate only on one mathematical problem connected 
with the calendar. We recommend that the reader interested in these 
details look them up elsewhere. 
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The lag will build up to almost one day in four years, so that 
the fixed date will gradually slip among the seasons, that is, 
January 1 will move from winter first to autumn, then to sum¬ 
mer. This would be inconvenient because periodic events such 
as crop sowing or the beginning of school year could not be 
tied to fixed calendar dates. 

We know how to remedy the situation. Some years must be 
decreed to have 365 days, and some 366, in order to have the 
average duration of the year as close to the true duration as 
possible. This approach can approximate the true duration 
with any prescribed precision, but the required rule of alter¬ 
nation of shorter (ordinary) and longer (leap) years may be 
undesirably complicated. We need a compromise: a relatively 
simple pattern of alternation of the ordinary and leap years 
which brings the average length of the year sufficiently close 
to the true value. 

1.2.2. Julian and Gregorian Calendars. This problem was 
first solved by Julius Caesar. Or rather, by the Alexandrian 
astronomer Sosigenes who was called for this purpose to Rome 
and given the job. The system introduced by Julius Caesar 
was asfollows: three successive shorter (ordinary) years followed 
by one longer (leap) year. Much later, when the Chris¬ 
tian chronology has been introduced, it was decided to have 
leap years when the number of the year was an integral mul¬ 
tiple of 4. 

This calendar is called the Julian calendar. The average 
length of the year in the Julian calendar is 365 ^ days = 

365 days 6 hours or 11 minutes 14 seconds longer than the 
true length. 

The Julian calendar was improved by Pope Gregory XIII. 
In fact, calendar reforms had been proposed and elaborated 
long before but were never implemented. In 1582 he enacted 
the calendar reform. The alternation of ordinary and leap 
years was retained, with an additional rule: If the number 
of the year ends with two zeros but the number of hundreds is not 
an integer multiple of 4 , the year is treated as ordinary . For 
example, this rule classifies the year 1700 as an ordinary and 
the year 1600 as a leap year. Furthermore, assuming that the 
error accumulated “since year 1 A.D.” was 10 days, Pope 
Gregory XIII ordered to add 10 days to the current date, 
namely, to consider the day following the Thursday, October 4 
of 1582, as Friday, October 15. There more days have been 
accumulated since then (in 1700, 1800, and 1900). Conse- 


2-0293 
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quently, at this moment the discrepancy between the Julian 
and Gregorian calendars is 13 days). 

What is the average length of the Gregorian year? Of 40U 
years of the Julian calendar, 100 are leap years, while four 
Gregorian centuries contain only 97. Hence, the average Gre- 

gorian year has 365^ days = 365.242500 days = 365 days 

5 hours 49 minutes 12 seconds, or 26 seconds longer than the 
true length of the year. 

We see that very high precision has been achieved by quite 
simple means. How could this result be achieved? 

This question will be answered in Chapter VI. 



Chapter 2 

Formation of Continued Fractions 


2.1. Expansion of a Real Number 
into a Continued Fraction 

2.1.1. Algorithm of Expansion into a Continued Fraction. 

Let us forget for a time the decimal number system. The 
brilliant Soviet mathematician Nikolai Luzin (1883-1950) 
used to say in his lectures that “the advantages of the decimal 
system are zoological, not mathematical. If we had eight 
fingers on our two hands instead of ten, mankind would operate 
in the octal system.” Decimal system is indeed very conve¬ 
nient in practice but it is inappropriate when theoretical 
aspects of arithmetic are discussed. 

We thus forego the decimal and any positional number sys¬ 
tem, that is, we take Archimedes’ place and ask ourselves: 
What would be the most natural approach to estimating a 
real number? 

This question is answered without hesitation: the first step 
is to indicate the integers between which our number lies. 
For example, 

lies between 2 and 3, 

Y 2 lies between 1 and 2, 
n lies between 3 and 4. 

Of course, it is sufficient to indicate only the lower of these 
bounds: 

■|f = 2-fx (0 < x < 1), 

/2=i + y (0<y<l), 

n = 3 + z (0 < z < 1). 

Note that this estimation is not bound to any specific no¬ 
tation of integers, that is, to a specific number system. 

6i 

Let us continue with the number ^ . Our estimate “two plus 
something” is too rough and constitutes only a first approxi- 
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mation . If we want to make the second step, we have to esti¬ 
mate the “makeweight” x. Since x is less than 1, it is logical 
to represent it by a fraction with numerator 1 (we again appeal 
to “acting natural”, but we do it for the last time): 


JL-2+-L 

27 — Z + * 


Now x x is greater than unity, and we repeat the familiar 
steps: we single out the integral part of the number, and so 
forth. The reader is invited to follow attentively this alterna¬ 
tion of steps: 


*L-2+JL-2 + J— 

27 ^ ' 27 ^ ' 27 ' 


24 



-2 + 


— 2 


3 + T 


The expression 
«o- 


«i + 


1 + 4 - 


fl s + 


where a ly a 2 , . . a 8 are natural numbers* and a 0 is a natu¬ 
ral number or nought, is called the continued fraction . 

The numbers a 0 , a x , a 2 , . . a 8 are called the terms of the 
continued fraction. We can say that we have expanded the 
61 

fraction ^ into a continued fraction. 

In what follows we shall often use this algorithm. It con¬ 
sists of two alternating steps. 

Step 2. Single out the integral component of the number, 
that is, write it as the sum of an integer and a remainder less 
than unity. 

Step 2 . Represent the remainder as unity divided by a 
number greater than unity. Apply Step 1 to this denominator, 
and so on. 

But before we go deeper into the theory of continued frac¬ 
tions, let us answer three questions. 

* Recall that the natural numbers are 1, 2, 3, ... . Nought is 
not included into the set of natural numbers. 
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2.1.2. Notation for Continued Fractions. Question One . 
Is not the notation for continued fractions too cumbersome? 
Our first example resulted in a three-storey fraction; for a 
twenty-storey fraction, the page space will not be enough. 

This is why various notations have been devised for con¬ 
tinued fractions. We shall use the following system 

^0 4 “ 7 = [& 0 * •••* ^| 1 ‘ 

i 1 


1 




The semicolon emphasizes the special role of the integral 
part a 0 , distinct from that of the other terms. Its role is 
special but not more important; in this particular case it is 
rather less important. 

2.1.3. Expansion of Negative Numbers into Continued Frac¬ 
tions. Question Two. How to expand a negative number 
into a continued fraction? 

Two approaches can be used to expand a negative number 
into a continued fraction. 

1. Place the minus sign in front of the fraction as a whole, 
for example, 


61 


27 — 


24 



= —12; 3, 1, 6]. 


2. Allow negative values of a 0l keeping a x , a 2 , . 
positive at all times. For example, 

_ 1 
”27 * “ L ‘ * “ L_ 


61 _ q , 20 _ Q 

97 — ** 4 ’ 27 — ^ 


1 


1 


14 - 4 - 


., a s 


= 1 — 3; 1, 2, 1, 6]. 

In this book we shall use only the second approach. Hence, 
from now on a 0 is an arbitrary integer while a x , a 2 , . . are 
natural numbers. 

Having made this remark, we shall not pay much attention 
to negative numbers when outlining the theory. A negative 
number can be obtained by adding a certain negative integer 
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to a positive number. In order to examine the arithmetical 

6 i 20 

nature of the number — ^, we can study the number ^ and 

then add three unities to it. 

2.1.4. Examples of Nonterminating Expansion. Question 
Three . Will the process of expanding a number a into a con¬ 
tinued fraction inevitably terminate? 

No, it may prove to be infinite. Let us have a look at some 
examples. 

Example 1. Expand Y 2 into a continued fraction. 

1/2 = 1 + !.; 

*'=/i=r=^ +1 - 2+ i : 

1 


We find that x 2 = x 1 . Consequently, everything will repeat 
itself from this point on: x 3 = x 2l x k = x 3l ... . We succes¬ 
sively obtain 


r2=i+J-=n 




=i 


2 + ^7 


2 + 


x 3 


As long as we give a finite expression for Y 2 (involving an 
irrational x n ), we can use the equality sign. If this process is 
continued indefinitely, we obtain 


/ 2 ~[ 1 ; 2 , 2 , 2 , ...], 

that is, the number y2 corresponds to a nonterminating con- 
tinued fraction. We cannot put the equality sign between 
Y 2 and the nonterminating continued fraction [1; 2, 2, 2, . . .] 
because we are as yet unable to transform from one of these 
notations to another in both directions. So far the symbol of 
nonterminating continued fraction is devoid of meaning. We 
shall discuss and solve this problem in Chapter VI. 

Example 2. In geometrical problems we can expand a geo¬ 
metrical quantity in a continued fraction without knowing 
the numerical value of the quantity. For example, let us find 
the ratio of the base to a leg of an isosceles triangle with the 
108° vertex angle. 
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The angles of triangle ABC (Fig. 6) are 108°, 36°, 36°. We 
mark off BB X = b (obviously, b can be marked off only once 
because a <2 b). We find 

a _ BC BBi + ByC * . B X C * . 1 

b ~ BB X ~ BB 1 ~ BB X ~ “■ x x ’ 

_ BB 1 _ AC 
Xi ~ B X C ~ B X C * 

However, triangle B X AC is similar to the initial triangle 

ABC. The first line above determined the ratio % of the base 

0 

to the leg. The second line represents the same problem be- 


A 



cause is the ratio of the base to the leg in a triangle of the 
same shape. The process cannot terminate because the first 
step resulted in the reproduction of the initial situation. We 
can write 

!- •••]• 

Likewise, it can be shown that 

4 ~[°; !. !- •••]• 

We shall return to this result at the end of Sec. 4.2.2. 

Example 3. Expand the ratio of the diagonal of a square 
to its side into a continued fraction. 

This example is more complicated than the second one. 
There we returned to the initial state after the first step, and 
here two steps are required. 

If we assume that 2, this example coincides with 

Example 1. But the expansion of the ratio - into a continued 

fraction can be obtained by geometrical means, without the 
numerical information. 
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Initial situation: we mark off the side along the diagonal, 
which can be done only once. We find (Fig. 7): 

d _ CA _ CBi-\-B x A , 1 

a ~ CB ~ CB ~ 1 Xl ’ 

_ CB _ AB 

Xi ~ B X A ~ AB X * 


Now we erect B X B 2 J_AC. Then BB 2 = B l B 2 (please, prove 
it yourself). Now we supplement triangle AB X B 2 to a square 
(merely for the sake of illustrative clarity; this is not neces¬ 
sary for the proof), and mark off 
AB X along BA. Having marked 
it off once, we obtain BB 2 , the 
remainder being B 2 A. Now we 
must mark off AB l along B 2 A , 
but this is a repetition of the 
initial situation: the side of a 
square is marked off along the 
diagonal. Hence, the process 
is infinite, that is 



— 2 + — ; 
x i 

[ 1 ; 2 , 2 , 2 , ...]. 


(t can be easily shown that 


- J ~[ 0 ; 1 , 2 , 2 , ...1 

(see Subsec. 4.2.2). 


2.2. Euclid’s Algorithm 

2.2.1. Euclid’s Algorithm. The preceding Section dealt 
with the algorithm of the expansion of real numbers into 
continued fractions. This algorithm consisted of two alter¬ 
nating steps: (1) separation of the integral part of the number, 
and (2) presentation of the remainder (which is less than uni¬ 
ty) as a reciprocal of a number greater than unity. This algo¬ 
rithm is a particular case of Euclid’s algorithm which is 
widely used in mathematics. 

Let us first illustrate how Euclid’s algorithm operates in 
finding the greatest common divisor (abbreviated to GCD) 
of two natural numbers. 
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Let p and q be natural numbers. Euclid’s algorithm con¬ 
sists of the following steps*: 


Divide 

Quotient 

Remainder 

P by q 

*0 

**0 

9 by r„ 

*1 


r„ by r t 


r 2 


The following formulas 
p = a 0 q + r 0 

9 — a i r o + r 1 
r 0 = a 2 r l -^r 2 
r t = a 3 r 2 + r 3 


describe the process: 
(0 < r 0 < q); 
(0<r 1 <r 0 ); 

(0 < r 2 < r,); 
(0<r 3 <r 2 ); 


( 3 ) 


r,-3 = as- t r g - 2 + (0 < r s -i < r 8 . 2 ); 

r 8-2 = a 8 r 8-i' J 

Clarification. If the dividend and the divisor are integers, 
the remainder may be zero. Why then do we exclude the 
equality sign on the left, that is, why do we write, e.g. 
0 < r i < r 0 instead of 0 ^ r x < r 0 ? Because if it so happens 
that r t = 0, this equality terminates the sequence. The 
algorithm cannot but stop because the remainders r 0 , r 1? r 2 , ... 
are non-negative integers, and each next r is less than the 
preceding one. Hence, the remainder will necessarily become 
zero at some step. 

Equalities (3) can be rewritten as follows: 




r s -3 
r 8—2 


a 8-l 


r s -1 


r s—2 


where 


r 8-2 


= a s 


is what we seek: the GGD of p and q. 


* If p < q y then a Q = 0. This iD no way disrupts the process. 
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Each of these equalities with the exception of the last one 
is an improper fraction presented as a sum of an integer and a 
proper fraction. Note that the left-hand side of each equality 
(beginning with the second one) is the reciprocal of the proper 
fraction of the preceding equality. We can, therefore, eliminate 

all r t successively. By replacing the fraction ^ in the first 
equality by its expression from the second equality, we find 



a i + ~ 
r 0 


The fraction — in the equality obtained above is now re- 
r o 

placed with its expression from the third equality 



a i 


1 

1 

I r 2 

a 2 + — 
r l 


By continuing this process, we shall finally expand into 

a continued fraction. However, there is no need to repeat 
each time these substitutions. Indeed, a 0 , a x , a 2 , . . ., a s are 
the terms of the sought continued fraction. It only remains 
for us to remember the following rule: 


In order to expand into a continued fraction , apply Euclid's 


algorithm to the numbers p and q. The quotients obtained in 
the successive divisions are the terms of the sought continued 
fraction. 


Example. Expand the fraction ^ into a continued fraction. 


61 

27 

27 

7 

7 

6 

6 

7 

2 

6 

3 

1 

1 

0 


Hence, g = [2; 3, 1, 6], 


2.2.2. Examples of Application of Euclid’s Algorithm. 
Euclid’s algorithm can be used not only to find the GCD of 
two natural numbers. Let p and q be elements of an arbitrary 
set in which division with remainder* has been defined. 
Euclid’s algorithm can then be employed. 


* This means that each ordered pair of elements p and q (p is the 
dividend and q the divisor) is put in correspondence with an ordered 
pair a and r (a is the quotient and r the remainder) which satisfies 
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For example, if p and q are segments of the number line, 
Euclid’s algorithm can be applied to find their common meas¬ 
ure. If p and q are commensurate, Euclid’s algorithm termi¬ 
nates and the segment r s . x [see formulas (3)] is their common 
measure. Indeed, it follows from the last equality in (3) that 
r s -i is contained in r s _ 2 an integral number of times. By 
substituting r s _ 2 into the last-but-one equality, we obtain 

r 8 -3 = a 8-\ a s r 8-l + ^*8—i = ( a s-i a s + 1) 


Hence, r s _ 3 is also an integral multiple of r^. If we thus 
climb the ladder of formulas (3) one step at a time, we reach 
the first two lines, that is, we prove that both p and q are 
integral multiples of r s _ x , and therefore r s . x is the common 
measure of p and q. 

In addition, Euclid’s algorithm yields the terms of the 
continued fraction which corresponds to the ratio . If the 

segments p and q are incommensurate, Euclid’s algorithm 
does not stop, and the numbers a 0 ; a x , a 2 , ... are the terms 
of the nonterminating continued fraction which represents 

the ratio 

Euclid’s algorithm is also applicable to polynomials of 
one variable x. The phrase “is less than” then means “is of 
lower power”. By using the algorithm, we can find the GCD 
of two polynomials; however, this result has no bearing on 
our topic. 

2.2.3. Summary. This chapter has outlined an algorithm 
(in its two versions) that permits to expand any real number 
a into a continued fraction, that is, to find a continued frac¬ 
tion corresponding to the number a. 

If a is a rational number, it corresponds to a terminating 
continued fraction. In this case the calculations can be car¬ 
ried out in reverse order, that is, we can find the value of the 
continued fraction. For example, 


1 _ 0 , 7 _ 61 

o . 6 27 “ 27 • 

3 + - 


the conditions p = aq + r, r < q. Obviously, the operation of mul¬ 
tiplication and the relation “is less than” must also be defined on this 
set. 
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Therefore, instead of the sentence “the continued fraction 

61 

[2; 3, 1, 6] corresponds to the number ^” we can say that 

“the number equals the continued fraction [2; 3, 1, 6],” or 

61 

to be even more precise we can say that ^ and [2; 3, 1, 6] 

are two different notations for the same number. 

If, however, a is an irrational number, the situation is 
completely different. In this case the correspondence between 
a and the continued fraction is defined in one direction only: 
the number a corresponds to a nonterminating continued frac¬ 
tion, but not vice versa. We cannot determine a nonterminat¬ 
ing continued fraction by the same procedure as used to cal¬ 
culate [2; 3, 1, 6]. So far we do not know the meaning of non¬ 
terminating continued fractions. 

We are yet to solve this problem. It will be shown in Chap¬ 
ter 5 how to give the meaning to a nonterminating continued 
fraction. The reader will keep in mind, when reading Chap¬ 
ters 3 and 4, that so far we are unaware of that. 



Chapter 3 

Convergents 


3.1. The Concept of Convergents 

3.1.1. Preliminary Definition of Convergents. A continued 
fraction can be terminated by retaining the terms a 0 ; a 1? a 2 , 
. . ., a n and dropping all subsequent terms a n + 1? a n + 2 , . . . 
The number obtained by this operation is called the nth con¬ 
vergent and denoted by ■—: 

D 1 

—— = [a 0 ; #2» •••» a n\ = a o“f“ aj-)- • 



Thus, for n = Owe obtain the Oth convergent — = [a 0 ] = . 

Qo 1 

Note 1. This is not the ultimate definition of a convergent. 
It will be defined in Subsec. 3.1.3. 

Note 2 . The concept of convergents is applicable both to 
terminating and nonterminating continued fractions. In the 
case of a terminating continued fraction we come to the last 
convergent coinciding with the continued fraction itself. For 

61 

example, for the number we find 


P o 
Qo 


_2_ # 
1 ; 


7T=I 2 ' 3 >-T- 

-£= l2; 3 ’ n =T' 

17 = 12 ; 3 , 1 , 6 ]=| 1 . 


If a continued fraction is nonterminating, the sequence of 
convergents is infinite. We do not know yet the meaning car- 
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ried by nonterminating continued fractions, but this fact 
does not present obstacles to understanding convergents. For 
example, for the fraction [1; 2, 2, 2, . . .] the sequence is 

Po _ 1 . 

?o ~ 1 ’ 

f-=U; 21 = 4; 

~t= [1 ’ 2 * 2 '=b 


Hint. It is this fact (the possibility of forming the conver- 
gents) that enables us to breathe meaning into nontermi¬ 
nating continued fractions and assume that convergents are 
successive approximations which determine the value of the 
nonterminating continued fraction. 

This hint is a seed from which the theory will grow. We 
develop this hint where we (in Chapter IV) prove that the 
convergents of terminating continued fractions are successive 
approximations, and for the time being we check it for the 

number ^. In order to evaluate the approximations, note that 

~ « 2.259. 


Approximation 

Error 

number 

value 

1 

2 

1 

0.259 

2 

2.333 

—0.074 

3 

4- = 2.250 

0.009 


4 



We note that the errors form a sequence with terms which 
have alternating signs and decreasing magnitude. Further on 
this pattern will be shown to constitute the general rule. 

3.1.2. How to Generate Convergents. There is no need in 
writing out the whole multistorey continued fraction and 
carrying out the cumbersome process of successive evaluations 
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if we want to find the rath convergent. Quite simple recursion 
formulas* exist for evaluating p n and q n . Obviously, 

P 0 ^_o . 

Qo ~ 1 _ ’ 

Pi , j_ a i a o+f 

® 0i 

In order to go from — to — we have to replace a x by a, -f 

Qi Q 2 

— . Elementary algebraic manipulations give 
02 

P 2 0 2 (0i0q-H l)~h fl o 

q 2 0 2 0 i-f-l 

A careful look at this formula reveals the following struc¬ 
ture: 

P2 _ Pl a 2+P0 

£2 Qi a 2~\~Qo 

This formula reveals the general rule. Let us write it by 
giving separately the numerator and the denominator of the 
rath convergent: 

Pn = Pn-i a n Pn- 2? | 

Qn = Qn-l a n H - Qn- 2> # (4) 

n = 2, 3, s. J 

Before proving formulas (4), let us first clarify their mean¬ 
ing. We shall not assign individual meanings to p n and q n 
(although this restriction will be dropped in the next Sub¬ 
section). Formulas (4) are to be interpreted as follows: we 
may take p n and q ni as well as values proportional to them, 
as the numerator and denominator of the nth convergent. 
^ **Let us prove formulas (4) by mathematical induction. As¬ 
sume that they hold for a certain fixed n that we denote by k 

Pk = Ph-lO'h-'r Ph-2y 
Qh = Qh-l a h + Qh- 2- 
and then prove that (4) hold for n = k -f 1. 

* A formula expressing an arbitrary element of sequence in terms 
of one or several preceding elements is said to be a recursion formula. 
Thus, the nth term of a geometric progression is given by a recursion 
formula un = u n . 1 q or by a non-recursion formula u n = u x q n - x . 
The recursion formula does not allow to evaluate u n immediately; 
we have to find successively u u u 2 > . . ., u n . 

** The symbol ► marks the beginning of a proof. 



31 



An analysis of expressions 


Pk __ 

<ih , 

“ + ^+ 


1 


9 


Ph+l 

Qh+i 




1 

1 


a fe+i 


a h- 


gives that in order to go from ^ to , it is necessary to replace 

a h by a h - |-. Let us carry out this substitution in formu- 

a &+1 

las (5). Note that p h . 2 , q h - 2 , p h -x, and q remain unchanged 
because they do not contain a fe . We have 


Qk + i = Ph~i ( ) +Ph-2 


a h+i 


Ph +1 


= Qk-i ( 


l(Pk-l a h + Ph- 2) a fc + i + Pft-il »* 
1 




fl fe+l 


■ ) + Vh- 2 


a fe+l 


[(2fe-l a fe + (Zfe- 2 ) + i + Jfc-iJ* 


Since p h + x and g fe+1 are defined to within a proportion¬ 
ality factor, we shall drop the factor 1 !a h + x and replace the 
expressions in parentheses via formulas (5): 

Ph+ 1 = Ph a k+l +Ph-li 1 
Qk+ 1 = Qk a k +1 + J 

We have therefore arrived at formulas (5) with k + 1 sub¬ 
stituted for k. 

Furthermore, we have already seen that formulas (4) hold 
for n = 2. We have thus proved that they also hold for n = 
2, 3, . . ., 5 . ■* 


♦ The symbol ■ marks the end of a proof. 
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3.1.3. The Final Definition of Convergents. We are now 

ready to change the meaning of the term “convergent”. We 
fassume that the convergents of orders zero and one are the 

fractions — and — , respectively, with p 0 = a 0 , q Q = l, p x = 

Qo Qi 

a 0 a i + 1, q 1 = a x . Also, we assume that the convergents 
of orders 2, 3, . . ., s are The fractions whose numerators and 
denominators are given by formulas (4) for n = 2, 3, . . ., s. 

The reader may not have noticed the change. Did we really 
use a different interpretation of convergents? 

The point is that the same number can be written in diffe- 

1 2 

rent ways. For example, notations 0.5, g, and ^ stand for the 

same number. Until now the term “rath convergent” has been 
interpreted as a certain number regardless of notation. 

Thus, different answers could be given in the Example of 
Subsection 2.1.2 to the question “What is the second conver- 

fil i q \ q 

gent of 2^ , 2.25, g-, and so forth. All of them repre¬ 
sent the same number in different notations. However, from 
now on the term “ convergent ” will mean for us not only a definite 
number but also a prescribed notation of this number . Thus, we 

decide that the convergent for ^ is| while ^ would be an 

incorrect answer*. Now both the numerator and denominator 
of each convergent are strictly defined , not to within a propor¬ 
tionality factor (in the example above, p 2 = 9, q 2 = 4). 

This convention is very important for the further elabora¬ 
tion of the theory of continued fractions. 

If we note that all letters in formulas (4) stand for natural 
numbers, it will be easy to understand that the denominators 
(as well as numerators) of successive convergents strictly increase , 
that is, q n > g n _ 1? p n > p n _x (n = 2, 3, . . .). A comparison 
°f p 0 , q 0 with pn q x leads to 

Pi = Po a \ + 1> qo = Qi = a ii 
whence pj > p 0 , while q 0 may prove to equal q x . Finally, 

q 0 < Qi<Q 2 <Qs •••; | 6 

Po < Pi < P 2 < Ps • • • J 

9 18 

* Note, in passing, that — and - 5 - are indeed different fractions , 

4 o 

even though they represent the same rational number . 


3-0293 
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Sequences (6) can be finite or infinite, depending on whether 
the continued fraction generating them is terminating or 
nonterminating. 

3.1.4. Evaluation of Convergents. We give now a con¬ 
venient way of arranging the results when evaluating conver¬ 
gents. The values of a t will be written in the first, p t in the 
second, and q t in the third rows. 


a 0 #2 ^3 a s -i ^8 


Po 

Pi 

P2 

Ps 

... 


p 8 

Qo 

9i 

<h 

Qs 

q 8 -i 



We begin with filling in the first row and the first two co¬ 
lumns. The new entries are evaluated in the following order: 


a n-2 a n -1 a n 


Pn- 2 

Pn -i 


Qn-2 Qn -1 



(1) the column 


Pn -1 
Qn-i 


is multiplied by a n ; (2) the obtained 


column is added to the preceding one. 

The same scheme is recommended if we want to calculate 
the value of a terminating continued fraction: the answer is 


given by the last column 


Ps 
Q8 


This is much simpler than 


the step-by-step procedure. 

The reader can practice filling in the table for a continued 
fraction [0; 3, 14, 1, 2, 5]. 


0 | 3 | 14 | 1 | 2 5 


0 

1 

14 15 44 

235 

1 3 43 

46 

135 

721 


3.1.5. Complete Quotients. It is often necessary to terminate the 
process of expanding a number into a continued fraction before the 
end is reached. For example, 


_6L_2+JL 

27 ^ 27 
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or 


1 




27 7 

The numbers — and — in the expressions above are called the com¬ 


plete quotients (the definition will be given below). The following 
notation is in use: 


that is, the complete quotient is separated from the preceding terms 
by a vertical bar. 

The complete quotient a n can be defined in the following manner 

1 

a = a 0 -\ - 3 -, (7) 

ai+ «7+: 


where 


an - i+ ^’ 


a n — a n 


a n +1 + 


(8) 


The complete quotient is thus a continued fraction which begins 
not with a 0 but with an arbitrary element a n , that is, the continued 
fraction whose n terms, from a 0 to lf have been cut off. The sym¬ 
bolic notation of (7) is 

a = [a 0 » a i> a 2 » ••• » a n-i I a nl* (9) 

Complete quotients possess the following property: if some two 
complete quotients coincide , that is , a n = a n+ ^ (k > 0), then , first , this 
coincidence repeats itself after every k steps : 


OC n — — a n+2ft — • • • — a n+mfc — • • • 

and , second , the continued fraction itself is nonterminating and periodic 
{repeating). 

The proof is obvious, and need not be given here. We shall only 
outline the first step. When we apply the algorithm of expanding 
a number a into a continued fraction and come to some complete 
quotient a n , our subsequent steps are independent of the preceding steps y 
that is, of the terms a 0 , a ly a«» • • •> a n-v Therefore, the terms that 
follow a n +k_i in the continued fraction are the same that follow a n _ v 

If a n is a natural number, then a n = a n , and the vertical bar in 
equality (9) can be replaced with a comma. It is logical to set a 0 = a. 

The continued fraction ( 8 ) can be a terminating or a nonterminat¬ 
ing one. The meaning of nonterminating continued fractions will be 


3* 
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clarified in Chapter 4, so that for the time being we deal with (8) 
formally. 

We arc able now to derive a formula that relates complete quo¬ 
tients to convergents. A careful look at formula (7) reveals: if we drop 
| 

-in the right-hand side, the remaining part of a is a convergent 

a n 

— 71-1 that can be rewritten, by using formulas (4), as follows: 

Qn-l 

Pn -1 Pn-2 a n-i~\~ Pn -3 

Qn-l Qn- 2 a n-i~\~ Qn-s 

1 

If we replace in this formula a n . 1 by a n . 1 -)-, we convert the 

left-hand side into a: 


Pn -2 ^ a n-l~h~ j “h Pn -3 
Qn -2 + ~^~Qn -3 


_ ( Pn-2 a n-i Pn-3) a n Pn-2 

(Qn- 2 a n-i~\~ Qn-3) a n ~\~Qn-2 


Finally, 


Pn-i&n ~t~ Pn-2 
Pn-i a n "t" Qn-2 


3.2. The Properties of Convergents 

3.2.1. The^ Difference Between Two Neighbouring Conver¬ 
gents. The step from the rath convergent to the (ra-fl)th is 
the increment of the rath convergent, denoted by A n : 

A _ Pn+i Pn _ Pn + ]Qn PnQn+i ___ Dn (lCh 

Qn+i Qn QnQn+i QnQn+i' 

where D n denotes the numerator, 

D n = /»„ + !?„ — P n q n + l- ( 11 ) 

Let us reduce by 1 the subscripts of p n +i and Qn+ using 
formulas (4): 

Dn = (Pn a n +1 + Pn- 1 ) Qn Pn (Qn^n+1 Qn-\) 

— (PnQn-1 Pn-iQn)- 

The expression in parentheses is of the same type as (11) 
but all subscripts are less by 1. Hence, it equals D n - x : 

D n = D Ti—i • 

This recursion relation enables us to reduce the subscript to 
zero: 

Dn = - D n _, = D n _ 2 = - D n _ 3 =...=(- l) n ^o- 


36 



The only step which separates us from total success is the 
direct evaluation of D 0 \ 

D 0 = Ptfo ~ Po<h - («iffo + !)•! — «o«i = 1- 

Hence, 

— Pn + iQn PnQn+l = ( 
and by virtue of (10), 

^ __ Pn + l _ Pn ( l) n 

11 #n+l Qn QnQn+l 


( 12 ) 

(13) 


3.2.2. Comparison of Neighbouring Convergents. Let us 
mention some important properties of convergents. 

Property 1. Each odd-numbered convergent is greater than the 
neighbouring convergents. Each even-numbered convergent is 
less than its neighbours. 

When this regularity is tested, one must bear in mind that 
the zeroth convergent and the last one (provided it exists) 
have only one neighbour each. 

Formula (13) immediately proves that Property 1 holds. 

Property 1 indicates that successive convergents are alter¬ 
nately greater or less than their immediate predecessors. 

Property 2. The difference between neighbouring convergents 
decreases in absolute value as the number of the convergent 
increases. 

► Let us compare 


|A„| = 
|A n +il = 


QnQn+l 

1 


Qn+lQn+2 


This gives q n + 2 > q n . Hence, the denominator of the second 
fraction is greater, and the fraction itself is smaller: 

|A n+1 |<|A„|.B 


Property 3. The exact value of a terminating continued frac¬ 
tion a is between any two neighbouring convergents. All even- 
numbered convergents lie to the left of a, that is , they give an 
approximation of a by defect. All odd-numbered convergents lie 
to the right of a, that is, they give an approximation of a by 
excess. 

It is obvious that we have to exclude the last convergent 
which is exactly equal to a. 
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Instead of a formal proof, we shall only illustrate its maiu 
idea in Fig. 8. 

Figure 8 shows how the convergents are arranged on the 
number line. The numeral marks not the magnitude but the 
number of each convergent. The leftmost point is the No. 0 



Fig. 8 

convergent (i.e. the integral part of the continued fraction). 
A stop rightward is necessary to move from No. 0 to No. 1. 

This step (i.e. A 0 ) is shown by 
the upper arc. In order to move 
from convergent No. 1 to con¬ 
vergent No. 2 we need to step 
leftward, but this step (i.e. Aj) 
is shorter than A 0 ; then we 
make another step, and so on. 
We make steps alternately to 
the right and to the left, and 
each next step is shorter than 
the preceding one. Figure 8 
convinces us that Property 3 
holds. 

Figure 9 is another illustra¬ 
tion of the relative arrangement 
of the successive convergents. 
The abscissa axis gives the 
number of a convergent, and 
the ordinate axis presents its 
magnitude. The dashed line is drawn at the level of the 
value of a. 

Property 4. The absolute error of approximating the number a 

by 



Fig. 9 


a convergent is less than ~ , that is , 

(7 71 Qn 


1 


a 


Pn 


Qn 




(14) 


Indeed, by virtue of Property 3 and formula (13), we find 

Pn I - 1 


a 


c In 


< 


QnQn+i 
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This estimate is inconvenient because when approximating 
a & — we may not know the next convergent. For this rea- 

Qn 

son we replace g n + x in the last inequality by a smaller num¬ 
ber q n , thereby only strengthening the inequality. This proves 
inequality (14). ■ 

Property 4 shows that convergents are very good for approx¬ 
imating real numbers. If the fraction were not a conver- 

D 1 

gent, the absolute error would be a — — ^ —. 

B Qn ^ 2 q n 

3.2.3. Irreducibility of Convergents. Let us consider one 
more property of convergents. 

Property 5. All convergents are irreducible to lower terms . 
The numerators and denominators of convergents are given 

by formulas (4). Let us assume that the fraction can be 

reduced to lower terms, that is, its numerator and denomi¬ 
nator have a common multiplier X distinct from unity: 

Pn = ^Pni 

Vn = Wn, 

where p' n and q n are natural numbers. Then formula (12) 
yields 

* (Pn + l(7n PnQn+i) = ( — *)"• 

This equality is absurd since the left-hand side is divisible 
by X, while the right-hand side is not. Consequently, is 

Qn 

not reducible to lower terms. 

A set of mutually equal fractions contains only one irre¬ 
ducible fraction. The convergent may thus be defined as fol¬ 
lows: the convergent is the fraction , irreducible to lower terms , 
which gives the value of a truncoAed continued fraction. 



Chapter 4 

Nonterminating 
Continued Fractions 


4.1. Real Numbers 


4.1.1. The Gulf Between the Finite and the Infinite. We can 

evaluate terminating continued fractions and hope that the 
reader does want to learn how to deal with nonterminating 
ones. It is precisely such desires that energize the scientific 
progress. 

Any rational number can be converted to a terminating 
continued fraction. Conversely, any terminating continued 
fraction represents a rational number. Could it be that non¬ 
terminating continued fractions provide the means of repre¬ 
senting irrational numbers? 

Quite a few mathematical concepts that are familiar to us 
in finite form have captivating infinite analogues. Here are 
several examples. 

The meaning carried by decimal fractions is quite clear. 

33 

For example, 0.33 denotes^ . And what is meant by 0.333...*? 


The sum of a finite number of addends is also readily un- 

117 

derstandable. For example, 1 + 2 + 4 = 4 • w ^ at a bout 


1 4 .1+ -4-1 4- ? 

There are finite polynomials, such as 1 + 2x + 3x 2 . But is 
it permissible to operate with “polynomials having an infinite 
number of terms”, such as 1 + x + x 2 + . . . + x n + • . .? 

In spite of the apparent similarity, the finite and infinite 
are separated by a deep and wide gulf. Mathematicians man¬ 
aged to overlook this gulf until the 19th century. Ignoring the 
danger, they treated infinite objects as they treated finite 
objects, and sometimes obtained absurd results. In the 19th 


* The ellipsis is a mathematical symbol with two meanings. An 
ellipsis within a formula (e.g. 1 + x + x 2 + . . . + x n ) denotes 
a certain number of omitted terms; an ellipsis at the end (e.g. 1 + x + 
x 2 + . . .) stands for “and so on to infinity”. Ellipses can also 
re place the whole rows. 
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century the way to deal with the infinite was gradually found, 
and reliable bridges were erected across the separating gulf. 
We shall walk across by one of these bridges. 

Note that a terminating decimal fraction is in no way dis¬ 
tinct from an ordinary fraction; the only distinction is the 
notation. The fraction 0.33 has the numerator 33 and the 
denominator 100. But what is the numerator of the nonter¬ 
minating fraction 0.333...? We do not know the answer to 
this question, which is a clear indication that a nontermina¬ 
ting decimal fraction does not have the meaning carried by a 
terminating one. Academician N.N. Luzin used to say that 
drawing a symbol 0.333... does not impart a meaning to this 
symbol. It remains but a pattern. However, we can give 
this symbol a meaning. 

1 1 

The sum 1 + 5 + 4 ^ as a meaning because we can calculate 

13 3 17 

it by successive additions: 1 + 2 = 2 ’ 2 "^ 4 ~ 4 ‘ ®ut we 

111 

could not determine the infinite sum 1 + ^ + g + • • • 

by this method because the process of consecutive additions 
will never terminate. And this is not a technicality but a 
principal obstacle. It would be frivolous to hide behind the 

1 1 

argument that successive additions of terms in 1 + 2 + 4 + 

-g- +. . . give us approximate values of the infinite sum. 

What does not exist, cannot be looked for. The meaning of 
infinite sums must first be defined, and only after it we can 
speak about the approximate values of the sum. 

This is what we shall begin with now. Recall that we were 
going to cross the gulf between the finite and the infinite by 
one bridge (out of many). The name of this bridge is the 
Principle of Nested Segments or Cantor's Continuity Axiom* 
4.1.2. Principle of Nested Segments. We often say that 
the number line is continuous. Mathematicians always have 
to seek logically impeccable statements that replace intuitive 
notions. The principle of nested segments is the axiom that 
expresses precisely the property of the number line that we 
call its continuity. 

Recall that a segment is defined as a set of points of a num¬ 
ber line, consisting of two distinct points a and b (called the 

* Georg Cantor (1845-1918), the great German mathematician, 
created the set theory. The set theory became the foundation of the 
whole of mathematics. 
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ends of the segment) and all points between a and b. A segment 
is denoted by [a, b). A set including all points between a and 
b but not the points a and b themselves is called the interval 
and is denoted by ( a , b). The interval (a, b) contains two 
points less than the segment [ a , b], but in some cases this 
difference is extremely important. If one of the end points is 
added to the set of points between a and fc, the result is a 
half-closed interval. The same letter can be used to denote a 
point of the number line and the number corresponding to it. 
Then 

segment la , b] : a ^ x ^ b\ 
interval (a, b) : a <C x < b\ 
half-closed interval [a , b) : a ^ x «< b\ 
half-closed interval (ra, 6] : a < .r ^ fr. 

Let us consider on the number line a sequence of segments 

^11 * * *> • • • 

having two properties: (1) each segment (beginning with the 
second one) is nested in the preceding segment, and (2) the 
length of the segments tends to zero (as n —oo). 

The first property signifies that all points of the rath seg¬ 
ment belong to the (ra —- l)th segment (Fig. 10). 



Fig. 10 

The second property means that for an arbitrary fixed length 
e there exists a number ra such that the length of the segment 
[ a n , b n ] is smaller than e (and the segments with greater 
numbers are obviously even shorter). 

In this case there exists a unique point that belongs to all 
segments. 

Let us give a compact reformulation of the axiom. 

Cantor’s continuity axiom. If an infinite sequence of seg¬ 
ments is given on a straight line , such that (1) each next seg¬ 
ment is nested within the preceding one , and (2) the length of 
the segments tends to zero , then there exists a unique point be¬ 
longing to all these segments. 

Now we shall give a more detailed explanation of this 
axiom. Figure 10 shows the first several segments of our se- 
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quence. We define the rath step as the transition from the nth 
to the (n + l)th segment. Each step eliminates some of the 
points. For example, point A in Fig. 10 belongs to the first 
segment but not to the second one. Hence, this point will be 
eliminated at the first step of the process. Point B will sur¬ 
vive the first step but gets eliminated at the second step. 
Point C survives the first two steps, but goes out on the third 
step, and so on. Each point of the segment [a u b x ] has its 
own fate. Some fall inside the 1000th segment but stay out¬ 
side of the 1001st. These points survive 1000 steps of the 
process but get eliminated at the 1001st step. 

The principle of nested segments implies that there exists 
the point X which will never be eliminated, that is, it sur¬ 
vives each step; in other words, it belongs to any segment, 
regardless of its number. 

The axiom states that such point exists. As for the unique¬ 
ness of this point, it was introduced into the same formulation 
for the sake of convenience and can be readily proved. Indeed, 
let us assume that two such points exist, X and Y. We denote 
by d the distance between them. We have stipulated that the 
length of segments of the sequence tends to zero. Let us find 
a number n such that the length of segment [a n , b n ] is less 
than d, 

Wnt ^Ti I ^ 

Then the segment [a n , b n ] cannot cover the segment XY = d, 
that is, points X and Y cannot belong to segment [ a n , b n ] 
(and those following it). We have therefore proved that there 
cannot exist two points belonging to all segments. 

Example 1. Let us consider the following segments on the 
number line 


| 0 , 1 ], [|. 


[ 




r 3 

5 1 

r 7 

9 1 

> 

L 8 ’ 

8 J* 

L 16 ’ 

16 J 


J_ J_1 ... 

I On M 


5 


Obviously, point y and only it belongs to all these seg¬ 
ments. 

Example 2. Let a sequence of segments be given 

[0, 1], [ 0? t] ’ t] ’ ■ ' - ’ [°’ IT I’'' -- 


Point 0 and only it belongs to all these segments. 
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In each of these examples we deal with a sequence of nested 
segments. We have easily singled out the unique point com¬ 
mon to the segments. The principle of nested segments states 
that such a point always exists, no matter how the sequence 
was generated, provided that the two conditions are met. 

Note. If we considered in Example 2 a sequence of intervals 

( 0 , 1 ), ( 0 , 1 ), ( 0 , 1 ), ... , ( 0 , 1 ), ... 

this sequence would not contain a point common to all inter¬ 
vals even though they are nested and their length tends to 
zero. Indeed, point 0 does not belong to any of them, while 
any other point of the interval (0, 1) will be left out at some 
step. 

It is thus essential that Cantor’s axiom be applied to seg¬ 
ments. A similar statement would not hold for intervals. 

The principle of nested segments expresses the continuity 
of a number line: segments converge to a point of the line, 
not to a “hole”. Let us break the continuity of the line by 

1 1 
piercing it at point ^ • To be precise, let us remove point ^ 

from the number line. The remaining set of points, M, cannot 
be called a line. This is an assemblage of two so-called open 

1 
2 

us follow Example 1 and consider a sequence of segments. 
Now these are not segments of a line because they lack one 
point, but segments on a set M. Each of them contains two 
ends and all the points of set M between them. Although these 
are nested segments and their length tends to zero, no point 
of set M can be found that belongs to all of them. The prin¬ 
ciple of nested segments does not hold in M. 

4.1.3. The Set of Rational Numbers. Let us follow the 
process of gradual filling in of the number line with numbers. 
First we mark the integers. The set of all integers is tradi¬ 
tionally denoted by Z. No subtle arguments are needed to 
show that the points of set Z do not fill up the number line 
completely. Each two integers are separated with aY‘solid” 
mass of points (an interval) that so far remain nameless. 

Our next step is to mark rational numbers. It will be suf¬ 
ficient to mark all rational numbers within the interval be¬ 
tween 0 and 1. All rational points on the number line will 
then be obtained by displacing the segment [0, 1] leftward 
and rightward an integral number of times. 


rays, i.e. rays without vertices, 


j and ^ , oo j . Let 
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We shall mark rational numbers on segment [0, 1] as fol¬ 
lows: 

Step 1. Mark off fractions with denominator 2. There is 
only one such a fraction: 

Step 2. Mark off fractions with denominator 3, arranging 

1 2 

them in the order of increasing numerators: g, g. 

Step 3. Mark off fractions with denominator 4, arranging 
them in ascending order: ( 4 ) > 4 * The f ract i° n 4 is written 

in parentheses because this number has appeared earlier. 


Step (n — 1). Mark off fractions with denominator n, ar- 

12 3 ti 1 

ranging them in ascending order: ..., ^—. If frac¬ 

tions reducible to lower terms are encountered among those 
in this sequence, they can be crossed out. 


This process is infinite. Although we cannot complete it, 
we can be sure that it will mark all rational numbers in the 
interval between 0 and 1. Indeed, can there exist a fraction 
whose turn will never come? Let us select an arbitrary fraction 

37 

from this interval, say, Obviously, at some step of mark¬ 
ing fractions with denominators 2, 3, 4, . . . (namely, at 
the 88 th step) we shall reach denominator 89. Then arranging 

12 3 

the fractions in ascending order, 39 ? gg> gg > an ^ s0 forth, we 
37 

inevitably reach ^. Therefore, whatever fraction between 0 

and 1 is selected, it will certainly be reached and marked off 
on the segment [0, 1]. Let us suppose that the process has 
been completed. This means that all rational points, i.e. those 
representing rational numbers, have been marked on the 
segment [0, 1]. By displacing these points by 1, 2, 3, . . . 
units leftward and rightward we shall have all rational num¬ 
bers of the number line marked. In what follows we always 
denote the set of all rational numbers by Q. 

4.1.4. The Existence of Nonrational Points on the Number 
Line. Is the number line completely filled up by the points 
of the set Q? No, it is not. Some points of the line do not be¬ 
long to Q; they are not rational. However, this is not as obvi¬ 
ous as for the set Z, and subtle arguments are needed to clarify 
this case. Pythagoras is reputed to have made the following 
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great discovery: there exists no number* whose square equals 
2. An equivalent formulation is as follows: the diagonal 
of a square is incommensurate with its side. If all rational 
points are marked on the number line (Fig. 11), the arc of a 
circle whose radius is the diagonal of square OA passes freely 
through the numerical axis without intersecting the set Q. 

Nevertheless, the set Q is everywhere dense on the number 
line. This means that any segment of the number line, however 
short, contains rational points. Consequently, even though 



Fig. 11 

rational points do not exhaust the number line, this line 
does not contain segments that would be absolutely free of 
rational points. This can be proved easily if the reader recalls 
how we went about marking the rational points on the seg¬ 
ment [0, 11. 

We shall consider sequences of nested segments 

1^1 , ^ 2 ^ • • m • • • 

on the set Q (i.e. the ends of the segments are rational points). 
The principle of nested segments does not hold on the set Q. 
Even if the conditions of Cantor’s continuity axiom are met, 
there may not exist a point in Q that belongs to all these seg¬ 
ments. We shall immediately see that this fact can be used to 
invent numbers of a new type: irrational numbers. 

4.1.5. Nonterminating Decimal Fractions. Let us attribute 
the following meaning to the symbol of a nonterminating dec¬ 
imal: a nonterminating decimal is a sequence of nested seg¬ 
ments on the set Q. By terminating this fraction after each 
decimal place, we obtain the left ends of the segments. Adding 
unity to the last decimal place we find the right ends of the 
segments. For example, the fraction 0.313131... denotes the 
following sequence of nested segments on the set Q: 

[0.3; 0.41, [0.31; 0.32], [0.313; 0.314], .... 


* No rational number, in fact. The qualification is not made 
because so far we do not know any other numbers. 
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Each step diminishes the length of these segments by a 
factor of 10, and hence, the length tends to zero, regardless of 
the fraction chosen. 

Now we shall consider two examples that are superficially 
alike but in fact are profoundly different. 

Example 1. A nonterminating periodical decimal 0.333... 
stands for the following sequence of nested segments: 

[0.3; 0.41, [0.33; 0.34], [0.333; 0.334], . . . 

Is there a point belonging to all these segments? Here and be¬ 
low we mean a point in the set Q. No doubt, there is such a 
point on the number line. 

1 

This point is x = ^. 

The following inequalities hold: 

0.3 <-f <0.4; 

0.33 < - 3 - < 0.34; ^ 5 ^ 

0.333 <^-<0.334; 


For this reason the number | is said to be the value of the 

nonterminating decimal 0.333... The formal definition of this 
concept will be given below, but first we shall discuss an¬ 
other example. 

Example 2. Let us form two sequences: (a) the greatest 
decimal with 0, 1, 2, . . . n, . . . decimal places whose 
square is less than 2; and (b) the least decimal with 0, 1,2, ... 
. . ., n, ... decimal places whose square is greater than 2. 
We successively find 

l 2 <2 but 2 2 > 2; 

1.4 2 <2 but 1.5 2 >2; 

1.41 2 < 2 but 1.42 2 > 2; 


This process can be continued indefinitely. Is there a point 
in Q that belongs to all segments 

[1; 2], [1.4; 1.5], [1.41; 1.42], . . . ? 
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In other words, does there exist a rational number x that 
satisfies each of the following inequalities: 


1 < x < 2, 

1.4 ^ x ^ 1.5, 
1.41 < x < 1.42, 


(16) 


[Note that we use the sign ^ (not <) because we are look¬ 
ing for a point belonging to the segment, that is, a point that 
may coincide with one of the ends. Accidentally, the num¬ 
ber in Example 1 is always within a segment. If we began with 

\ 

a fraction 0.2000... corresponding to the number g, we would 
have to use the sign ^.] 

It is well known that there is no ^rational number that 
satisfied all the inequalities (16). This means that any ratio¬ 
nal number would violate inequalities (16), beginning with 
some line. It also means that the corresponding nontermi¬ 
nating decimal fraction 1.4142136... determined by the process 
described above is meaningless. 

Now is the right moment to give it the meaning it lacked. 

I 

4.1.6. Irrational Numbers. The notation ^ can be inter- 

1 

preted in two ways: (a) as a fraction ^, that is, as a ratio of 

two natural numbers 1 and 3, or (b) as a nonterminating de¬ 
cimal 0.333..., that is, as the common point of nested seg¬ 
ments 

[0.3; 0.4], [0.33; 0.34], [0.333; 0.334], .... 

The first interpretation is inapplicable to the number x 
that we seek with inequalities (16). However, this number 
corresponds to a nonterminating decimal, that is, a system 
of nested segments 

[1; 2], [1.4; 1.5], [1.41; 1.42], .... 

We can agree on a convention that this nonterminating deci¬ 
mal, or, what is the same, this system of nested segments, 
defines a number. This is a number of a new type: it cannot be 
presented as a ratio of natural numbers. Such numbers are 
called irrational. 

Let us additionally clarify the idea of introducing irratio¬ 
nal numbers. 
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The infinite sequence of nested segments (15) defines a num- 
ber. It happens to be a rational number: -. We can operate 

with it ignoring the sequence (15). 

The infinite sequence of nested segments (16) also defines 
a number but the type of this number is not familiar (we as¬ 
sume that only rational numbers were known) and the number 
appeared only as the sequence (15). 

4.1.7. Real Numbers. The name for rational and irrational 
numbers is real numbers. In other words, the set of real num¬ 
bers, R, is the union of the sets of rational and irrational 
numbers. 

When the concept of number is extended and generalized, the 
old numbers should be treated as particular cases of a broader 
concept instead of being opposed to the new numbers. In 
other words, there must be a universal principle of formation 
and a universal notation for all real numbers. 

The universal notation, also constituting the universal meth¬ 
od of formation is that adopted for nonterminating decimals. 

Some rational numbers can be presented as terminating dec¬ 
imals. However, we shall agree, in order to have a universal 
notation for all real numbers, to convert any decimal to a 
nonterminating decimal. This can be done in two ways, for 
example, 

0.5 = 0.5000..., 

0.5 = 0.4999.... 

In order to represent each real number by a nonterminating 
decimal in a unique manner, we agree on the following: 

Convention. It is forbidden to use nonterminating decimals 
with the numeral 9 for the period. 

Now the number 0.5 can be written as a nonterminating dec¬ 
imal in a unique way: 0.5000... . 

By virtue of this convention, each real number is written as 
a nonterminating decimal fraction in a unique manner, that 
is, no two distinct nonterminating decimals can represent the 
same real number. 

Let us emphasize that our definition actually identifies a re¬ 
al number as a nonterminating decimal fraction. Some real 
numbers can be written in other ways. For example, rational 
numbers are representable by common fractions. Roots from 
natural numbers are denoted by V2, ]^3, ..., /2 , . . . . 
And finally, some numbers have been labelled by individual 
(“personal”) symbols: n, e , and some others. However, non- 
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terminating decimals give a universal method of forming and 
presenting any real number. 

This method of introduction of real numbers does not create 
the set R “from thin air”. It assumes that a certain subset of 
R, namely, the set of all terminating decimals, already exists. 
The method allows us to supplement this set to R by using 
nested segments whose ends are given by terminating deci¬ 
mals. Real numbers can be defined differently, if we start not 
with the set of terminating decimals but with some other set 
which is everywhere dense on the number line. 

The reader would be mistaken to think that we have already 
constructed the theory of real numbers. The definition of real 
numbers given above is only a first step. Many more steps 
would be needed to construct the theory, namely, ordering 
real numbers (i.e. finding a method of comparing the magni¬ 
tudes), defining operations with real numbers (addition, mul¬ 
tiplication, etc.), to name a few. However, we are not going to 
further elaborate this aspect. The purpose of this subsection 
is to clarify the principle of nested segments which we shall 
use to interpret nonterminating continued fractions. 

4.1.8. Representing Real Numbers on the Number Line. 
Let a positive real number 

x = a 0 , oc 1 a 2 oc 3 . . . (17) 

be given. 

This is the decimal notation. Here a 0 is an arbitrary non¬ 
negative integer, and the remaining <x t are numerals from 0 
to 9. A terminating decimal fraction 

X n = a 0 , (%!&2 • • • G&tij 

which is obtained if numerals beginning with a n+1 in (17) 
are dropped, is called the approximate value of x with n deci¬ 
mal places by defect. If we add one unity in the last decimal 
place, 

x n = a 0> «1«2 + 

we obtain the approximate value of x with n decimal places by 
excess. If a n = 9 this unity can change the numerals preced- 

892 

ing a n . For example, for x = ^ = 0.99111..., we have 
x 2 = 0.99, x 2 = 1.00. 

Leaving aside the logical foundation, we can write the 
following apparent inequalities: 

x n ^ ^ • 
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Question to the reader . Why the sign in the left-hand inequal¬ 
ity is ^ while that on the right is <? Gan they be reversed 
after a certain modification of the preceding definitions? 

The following fact is of high importance: if all real num¬ 
bers are marked on the number line, the line is completely filled 
up. We shall give now a better formulation of this statement 
and prove it. 

Theorem 1. Each real number corresponds to a unique point 
of the number line . 

^ Let us take a positive real number x — a 0 , a x a 2 a 3 . . . 
This number must belong to an infinite sequence of nested 
segments 

[£> *o], l*i> *il» [x 2 , x 2 ], ... . 

The lengths of these segments form a geometric progression 
with common ratio By virtue of Cantor’s continuity axi¬ 
om, the number line contains a unique point belonging to all 
these segments. This is the point that corresponds to the num¬ 
ber x. ■ 

Theorem 2. Each point of the number line corresponds to a 
unique real number. 

^ Let a point x be given on the number line (x lying some¬ 
where on the right half-axis). If x is an integer, no more proof 
is needed. Otherwise, x lies between two neighbouring inte¬ 
gers a 0 and a 0 + 1- Let us begin the decimal notation of 
the number x with its integral part: 

x = a 0 ... 

We divide the segment [a 0 ; a 0 + U into ten equal parts. 
If x does not coincide with any one of the ten division points, 
it will be found between a 0 , a x and a 0 , a x +0.1. We extend 
the decimal presentation of x: 

x = a 0 , ... 

and divide the segment [a 0 , a x ; a 0 , a x +0.1] into ten equal 
parts. 

If at some step of this process point x coincides with one of 
the division points, then 

x = a 0 , a x a 2 . . . a n 000 . . . 

If coincidence never occurs, then 

x = a 0 , a x a 2 . . . a n . . . 


4 * 
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and x lies strictly inside all segments [x n , x n ] (ra = 0, 1, 

= 0 , 1 , 2 , . . .)• ■ 

Corollary. The set R obeys the principle of nested segments. 

4.1.9. The Condition of Rationality of Nonterminating 
Decimals. We know from the course of high school mathe¬ 
matics that each rational number can be expressed by a periodi¬ 
cal decimal (pure or mixed). For example, 

y = 0.333 ... ; -^- = 0.1444...; y = 0.2000... 

Conversely, each periodical decimal expresses a rational number . 

It then follows that each irrational number is expressed by 
a nonperiodical nonterminating decimal. For example, by 
using the algorithm of extracting square roots we can find 
any desired number of numerals in the decimal representing 

v% 

Y 2 = 1.4142135 ... 

We can always find one more numeral of the sequence. And 
though we do not know the formal law for the generation of 
this sequence of numerals (i.e. cannot find a function q> (n) 
giving the rath decimal place), we are sure that this fraction 
is not periodical. 

Conversely, each nonperiodical decimal expresses an irra¬ 
tional. For example, let us take a fraction 

0 . 1010010001 ..., 

where the number of noughts between two consecutive unities 
each time increases by one. This fraction is nonperiodical 
and, therefore, it stands for an irrational ^number. In this 
example the formal law for the sequence of numerals is quite 
simple; if u n is the rath decimal numeral, then 

_ fl if ra is a number of the type + ^ • 
u n — | 

10 otherwise 

4.2. Nonterminating Continued Fractions 

4.2.1. Numerical Value of a Nonterminating Continued 
Fraction. By terminating the m nonterminating continued 
fraction [ a 0 ; a l7 a 2 , a 3 , . . .] after each successive term, we 
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generate the consecutive convergents: 


-£*-=[«•]. -^- = [«o; a,1, -J2- 

Q0 Qi Qn 

= [a 0 ; ^2? • • • » ^nl ? • • • 

And although a nonterminating continued fraction is only 
a symbol to which no numerical value is assigned, convergents 
are rational numbers. They define an infinite sequence of 
nested segments 


r _Po_ 

_£l1 

r _£l 

P2 1 

T II- 


L ?o 5 

<h J ’ 

L <h ’ 

92 -1 ’ 

L Q2 9 

Qs J 



Pn-i 

Pn 1 





- *7 71-1 

’ Qn J 5 




( 18 ) 


We have already mentioned that the denominators of con¬ 
vergents strictly increase (see formulas (6)): 

Qo < ?1 < <12 < <13 < 


All q n being natural numbers, these inequalities mean that 
q n grow indefinitely: 


lim q n = 00 . 

n -*■ 00 


But formula (13) then yields 

lim An = lim ( Is±L — Is. ) = 0. 

n -> oo ti -*■ oc ' Qn+l Qn • 


The difference between neighbouring convergents tends to zero• 

It is always assumed in such statements that n -*■ oo. 

Each segment (18) is nested within the preceding one (see 
Fig. 8). By virtue of Cantor’s continuity axiom, there exists 
a unique point of the number line or, in other words, a unique 
real number, that belongs to all these segments. It is this 
number that we define as the value of the nonterminating conti¬ 
nued fraction. 

This definition implies the following corollaries: 

1. The value of a nonterminating continued fraction is be¬ 
tween any two neighbouring convergents. 

2. All even-numbered convergents are approximate values by 
defect, and all odd-numbered convergents are approximate va¬ 
lues by excess , of the nonterminating continued fraction . 

Let us have a look at Fig. 9 once again. If the continued 
fraction is nonterminating, the broken line does not have the 
last segment whose end lies on the dashed horizontal line. 
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This broken line has an infinite number of vertices that alter¬ 
nately fall above and below the dashed line; a is the value 
of the nonterminating continued fraction. 

3. The sequence of even-numbered convergents 

Po P2 Pa ' ' 9 P2T1 
Vo ’ Q2 ’ ?4 9 ’ Q271 5 

is monotone increasing and tending to a on the left. The se¬ 
quence of odd-numbered convergents 

P\ Pz Ps # # # P 2 TI +1 
<h ’ Qz ’ <h ’ 5 <72n+i 5 

is monotone decreasing and tending to a on the right . 

Let us look carefully at the sequence of segments (18). The 
ends of the segments are rational numbers. In our notation 
the first entry is alternately the left and the right end. This 
is obviously of no principal importance. 

4.2.2. Representation of Irrationals by Non terminating 
Continued Fractions. We have already learned the algorithm 
of expanding a real number into a continued fraction. Assume 
now that this algorithm is applied to an irrational a; what we 
obtain is a non terminating continued fraction [a 0 ; a l7 a 2 , 
a 3 , . . .]. Earlier this continued fraction was said to cor¬ 
respond to the number a: 

a ~ [a 0 \ a 11 a 2 , a 3 , . . .]. 

Now we know how to determine the value of non termina¬ 
ting continued fractions. A natural question then arises: is 
it a or another number that gives the numerical value of 
[a 0 ; a x , a 2 , a 3 , . . .]? In other words, is the correspondence 
between a and [a 0 ; a ly a 2 , a 3 , . . .] symmetrical? 

Yes, it is. 

Indeed, the convergents obtained in the process of expand¬ 
ing a into a continued fraction are alternately greater and 
smaller than a. For example, consider the first two steps of 
the process: 

a = a o+7Tt 

whence 
Further, 

*1 


a 0 < a. 


1 . 1 
-—H- 

a — a 0 1 1 x 2 
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whence 


1 


a — a 0 

As a result, 




a_fl o<-^-, a<a 0 + -^-. 
1 

a 0 < a < a 0 + — 

fl l 


or, in a different form, 


-^<a< 

00 01 


This chain of arguments can be prolonged further, that is, 
a is between any two neighbouring convergents. 

If we wish to reverse the arguments and find the value of 
the obtained nonterminating continued fraction, we need to 
recall that by definition this value is the common point of 
all segments (18), that is, of all segments between neighbour¬ 
ing convergents. 

However, there exists only one point that belongs to all 
segments (18). Therefore, the number a and the value of the 
continued fraction [a 0 ; a l7 a 2 , a 3 , . . .] coincide. We are thus 
entitled to replace the symbol « with the equality sign = : 


OC — ^ 2 * ^ 3 » •••)• 


4.2.3. The Single-Valuedness of the Representation of a 
Real Number by a Continued Fraction. Do continued fractions 
offer a universal mean of representing real numbers? In other 
words, is it true that any real number* can be represented by 
a continued fraction, and in a unique manner? 

The first part of the question has already been answered. 
Each real number can indeed be expanded into a continued 
fraction. A rational number expands into a terminating, and 
an irrational number to a nonterminating continued fraction. 
But the aspect of single-valuedness has not yet been analysed. 

Let us think over the following example: 


1 _ I 

1 1 

6 +t 6 +t+t 


or, in contracted notation, 



[0; 6, 4] = [0; 6, 3, 1]. 


* For the sake of simplicity, the arguments assume the numbers 
to be positive. It is clear, nevertheless, that the answer to the for¬ 
mulated question cannot change when considering negative numbers. 
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This transformation (the splitting-off of unity from the 
last term) can be effected in any fraction whose last term is 
distinct from unity. And if the last term is unity, it can be 
added to the last-but-one term (i.e. we can read the last 
example from right to left). 

It can be readily proved that this is the only reason for the 
non-single-valuedness of the representation of a (positive) 
rational number by a continued fraction. We shall eliminate 
this cause by introducing the following convention: The last 
term of a continued fraction must not be a unity . From now 
it is mandatory for us to choose the first of two notations 
[0; 6, 4] and [0; 6, 3, 1] for the same number. 

Now we are ready to prove that two continued fractions 
[a 0 ; a x , a 2 , . . .] and lb 0 \ b x , b 2 , . . .] (terminating or nonter¬ 
minating) are equal if and only if, first, they have identical 
numbers of terms and, second, their respective terms coincide, 
that is, Aq = bg, a^ — b^, etc. 

The condition “they have identical numbers of terms” must 
be understood as follows: either both fractions are terminat¬ 
ing and have identical numbers of terms, or they are both 
nonterminating. 

^ Let us denote by a the value of two equal continued 
fractions (we do not know whether each of them is terminating 
or nonterminating): 

OC ^ 2 ? • • *1 ^ 1 ? 

The term a 0 (as well as b 0 ) equals E (a)* and thus splits 
off a in a single-valued manner. Consequently, 


a 0 — b 0 . 

Subtract a 0 from a 

a — a 0 = [0; a x , a 2 , . . .] = [0; b x , b 2 , . . .] 
and consider its reciprocal value 

l 

—~— ~[ a u a 2 1 •••]—[&!» ^ 2 * • • • ]• 


* The definition of the function E (a) is: “The greatest integer 

not greater than a.” For instance, £ = 2, E (1) = 1, E |= 

—3. The symbol E(a) reads “the integral part of a”; the letter E 
comes from the French word “entier” (integral). 
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The term a j (and b t ) is E[ a ^_ a ) and hence, is determined 
in a single-valued manner by the value of —-— . Therefore, 

06 — CLq 

a l = 

and so forth. By repeating these arguments we shall prove 
that a 2 = b 2 , a 3 — fc 3 , etc. 

Gan two equal continued fractions have unequal number of 
terms? Let us assume that the first continued fraction is a 
terminating one with s terms, while the second fraction is 
either a terminating one with t terms, t> s, or it is nonter¬ 
minating. This means 


a 0 + 



= a 0 + 






or 


& 8 ®8 



whence 



which is impossible. Hence, t = s. 

We conclude that each real number is expressed by a con¬ 
tinued fraction, in a unique manner. ■ 

In this proof we have employed a rule that is often useful. 
When the expansion of a into a (terminating or nonterminat¬ 
ing) continued fraction is known, the following should be 

done in order to find the expansion of : 

(1) move the whole “comb"' one step to the right, if a 0 =/= 0, 
and write in a nought instead of the integer, or 

(2) if a 0 = 0, move the whole “comb” one step to the left. 

Examples. 

a=[3; 1, 2, 5], -^-= [0; 3, 1, 2, 5]; 

P= [0; 2, 2, 2, ...], ^-=[2; 2, 2, 
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The proof follows from a careful examination of the fol¬ 
lowing continued fractions: 


a on - 

ai 

, i 


fl 2 + 



1 


i i 

• 1 , i 

“i+. , 


— [0| ®o> ®1» • • •]• 


4.3. The Nature of Numbers Given by 
Continued Fractions 

4.3.1. Classification of Irrationals. We already know the 
following important fact: each rational number is given by a 
terminating continued fraction , and irrational number by a 
nonterminating continued fraction. 

We shall not add anything here on rational numbers. But 
irrationals may be of very different types. Let us get 
acquainted with their classification. 

The equation 

a 0 x n -|-a 1 a: n-i + ...-f-a„.,x + a n = 0, (19) 

where a 0 =/= 0, is called the algebraic equation of degree n. 
We shall consider only the case when all coefficients in equa¬ 
tion (19) are rational or even integral. These two cases are 
identical. If the coefficients are fractional, we can multiply 
both sides of the equation by the common denominator of 
these fractional coefficients, thus obtaining an equation with 
integral coefficients equivalent to the initial equation. 

In our discussion of equation (19) we assume hereafter that 
its coefficients are integers (positive, negative, or zeros). An 
additional constraint is imposed on the coefficient of the 
leading term: a Q =/= 0. 

A real number is called an algebraic number of degree n if 
this number is a root of an algebraic equation of degree n with 
integral coefficients but is not a root of any other algebraic 
equation of lower degree with integral coefficients. 
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Example 1. Each rational number is an algebraic number 
of the first degree because it is the root of equation 

qx — p = 0. 

Example 2. The number ]^2 is an algebraic number of the 
second degree because it is a root of the equation 

z 2 - 2 = 0. 

We know that Y 2 cannot be a root of any equation of the 
first degree with integral coefficients because such an equa¬ 
tion (a 0 x 4- = 0) has a rational number x = —— for the 

a o 

root. 

Algebraic numbers of the second degree are called quadratic 
irrationalities. 

It was discovered that there exist nonalgebraic numbers. 
These numbers are called transcendental numbers. 

Here is their definition: A real number a is said to be trans- 
cedent if it is not a root of any algebraic equation with integral 
coefficients. 

To discover a transcendental number is not an easy task. 
If we want to prove that a number a is algebraic, it is suf¬ 
ficient to find an algebraic equation with integral coefficients 
for which a is a root. But if we cannot find this equation, we 
cannot conclude that a is transcendental, for we have to prove 
that no such equation exists. This problem was solved for 
the first time by the French mathematician Joseph Liouville 
in 1844. He proved the transcendency of some specific real 
numbers. In 1882 the German mathematician Ferdinand Lin- 
demann proved that n is a transcendental number. At present 
very many examples of transcendental numbers are known. 
For example, decimal logarithms of all rational numbers, 
with the exception of numbers of the type 10 n , are transcen¬ 
dental. 

The reader must be warned against a misunderstanding. The 
fact that examples of transcendental numbers are difficult to 
find does not mean at all that they are rare. Quite the opposite! 
Georg Cantor showed that in a certain sense (it would not be 
possible to explain what this means in this booklet) almost 
all real numbers are transcendental, that is, algebraic num¬ 
bers are rare exceptions. However, the nature of algebraic 
numbers is simpler, and therefore we can give numerous 
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examples of them. As for transcendental numbers, it is always 
very hard to prove the transcendence in each specific case*. 

4.3.2. Quadratic Irrationals. We know from the preceding 
subsection that a quadratic irrational is an irrational number 
which is a root of a quadratic equation with integral coefficients. 

The word “irrational” replaces the phrase found in the pre¬ 
ceding definition: “and it is not a root of any algebraic equa¬ 
tion of lower degree with integral coefficients”. In the case in 
question this means “is not the root of any equation of first 
degree with integral coefficients”, that is, is not a rational 
number. 

Let us consider a quadratic equation 
a 0 x 2 0i# + a 2 = 0, 

where a 0 , a l9 a 2 are integers and a 0 =/= 0. Its roots are given 
by the formula 

_— fli ± Ya\~ 4a 0 a 2 

X ~ 2a 0 

These roots are quadratic irrationals if the following neces¬ 
sary and sufficient conditions are satisfied: 

(1) the discriminant D = a\ — 4 a 0 a 2 must be non-negative. 
If D <0, the roots would not be real; 

(2) the discriminant D must not be an exact square. If 
D = N 2 , the roots would be rational. 

These conditions enable us to give a different definition of 
the quadratic irrational: a quadratic irrational is a number of 
a type p + qVD where p and q are rational numbers and D 
is a natural number which is not an exact square. 

Before analysing some examples of quadratic irrationals, let 
us prove in advance four useful lemmas. But first we shall 
introduce some notations and define some terms that will 
help us to avoid repetitions. 

Lower-case Roman letters p, q, ... will always denote ra- 
tionals (positive, negative, or zero). In particular cases they 
may happen to be integers. 

Capital Roman letters D, M, N, . . . will denote natural 
numbers not equal to exact squares: 2, 3, 5, 6, 7, 8, 10, . . . 


* Actually, a simple method is known for constructing continued 
fractions whose values are transcendental numbers. However, if one 
faces the problem of/proving the transcendence of a number that has 
been defined by different means (n, log 2, sin 1, etc.), this always con¬ 
stitutes a very difficult problem. 
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Two square-root radicals* Y M and YN are said to be 
similar if YN = p Y M. Otherwise, that is, if the ratio 

is not rational, the radicals Y M and Y^ are not similar. 
For example, ]^2 and Y& are similar but Y 2 and Y 10 are 
not. 

If YM and YN are nonsimilar radicals, then YMN is also 
a square-root radical (i.e. it is not an exact square) nonsimilar 
to either of the initial two. This is clear from the identities 

YMN= N (not a rational number), 

Ym Y n 

Lemma If YM and Y N are nonsimilar radicals , then 
the equality 

k + lYM mYN = 0 ( 20 ) 

holds only if k = l = m = 0. 

In a more concise notation, 

k-\-lY M +mY N = 0 <=> k = l = m = 0. 

► Two cases must be considered in the proof: (1) 1=^0 
and m=£ 0 (for any /c), and (2) one of the coefficients Z, m is 
nonzero while the other vanishes. 

In the first case we transpose k to the right-hand side and 
raise both sides of the equality to the second power. After 
some manipulations we obtain 

21 mV MN = k 2 — l 2 M — m 2 N, 


that is, VMN is a rational, which is incorrect. Hence, the 
first case can not take place. 

In the second case we see from equality (20) that or 
I^A'is a rational, in contradiction to the imposed condition. 
Hence, the second case is also rejected. 

We thus have to recognize that l = m = 0. Equality (20) 
then shows that k = 0 as well. ■ 


* This is an example of familiar casual usage: “square-root radical” 
means not only the symbol Y Omt stands for the operation of extract¬ 
ing the square root, but also refers to any number of the type Y%r 
Y 3 etc. 
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Lemma 2. If ]/M and ]/N are nonsimilar radicals, then the 
equality 

k+lVM+mVN + nV~MN = 0 (21) 

is possible only if k = l = m — n = 0. 

In a shorter form 

k + lVM+mVN + nVWN = 0<^>k = l = m = n-= 0. 

^ Assume that l =/= 0, m =/= 0, and n =/= 0. We transform 
equality (21) to the following form: 

lVM + mVN=.-k-nVWN. 

Let us square both sides of this equality. Simple transfor¬ 
mations then give 

2 (Im - kn) V~MN = k 2 + n 2 MN - l 2 M - m 2 N, 

that is, VMN is a rational number, which is incorrect. We 
thus have to reject the assumption l ^ 0, m =/= 0, n =/= 0, that 
is, we have to assume that at least one of the coefficients, l , m, n 
vanishes. But in this case equality (21) reduces to (20) so that 
by virtue of Lemma 1 all the remaining coefficients vanish. ■ 
Lemma 3. If p + q Y M is a root of the equation 

a 0 x n + a^ 71 " 1 + ... + a n -iX +a n = 0 

with integral coefficients, then p — qY M is also a root of this 
equation . 

^ Given: 

«0 (P + Q VM) n + ai (p+q VM ) n + ... 

+ a n -i (/> +g/M)-t-a n = 0. (22) 

To be proved: 

a 0 (p-p VW + «i (p-q VW + ... 

a n-i (p—VM) -f- a n = 0. (23) 

Let us remove the parentheses in (22). The terms {qYM) a 
obtained thereby are subsumed under two types: 

(1) a is even (including a = 0). All these terms are rational. 

We denote their sum by k . _ 

(2) a is odd. All these terms are of the form sYM. We de¬ 
note their sum by lYM. 


62 



Equality (22) thus transforms to 

k+lYM = 0. (24) 

Let us make similar transformations over equality (23) 
which is obtained from (22) by substituting —qV M for qY M. 
This substitution does not affect the terms containing qYM 
to even powers, while the terms containing qYM to odd 
powers only have their sign reversed. Equality (23) will thus 
become 

k—lYM = 0. (23) 

Equality (24) can only hold if k — l = 0. 

Indeed, if l=/= 0, equality (24) indicates that Y M is a 
rational. And if l = 0, then k = 0 as well. 

But if k = l = 0, then (25) also holds. ■ 

Let us outline once again the idea of the proof. Equalities 
(24) and (25) are the equalities (22) and (23) appropriately 
transformed. Equality (24) yields k = l = 0, and k = l = 0 
implies that (25) holds. 

Lemma 4 . If p + qY M + rY N, where YM and YN are 
nonsimilar radicals , is a root of an equation with integral coef¬ 
ficients , then the numbers p ± qYM ± rY N, regardless of the 
combination of signs , are also the roots of this equation . 

To contract the notations we denote 

P (x) = a 0 x n + aix n ~ l + ... + a n ^x + a n . 


Given: 


P(p + qVM + rVN) =a 0 (p+qVM 
+ rVN) n + ai (p + qVM + rVN) n - ‘+ ... 

+ a n-i {p<lYM-\-r YN) -)-a n = 0. (26) 

To be proved: 

P (p±qVM ±rV~N) = 0. 

^ Now we remove the parentheses in (26). All the terms 
obtained thereby will be of the form 

Ap*{qVM)*{rVNy, 
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where A are coefficients and ct, (3, y are non-negative integral 
exponents. We subsume these terms under four types: 


Type 

3 

y 

Kind of the terra 

1 

Even 

Even 

Rational 

2 

Even 

Odd 

t Y~N 

3 

Odd 

Even 

u YM 

4 

Odd 

Odd 

v YMP 


When parentheses in (26) are removed, we obtain 

P(p-\-qVM + rVN) = k^l\fM + mVN + nVMN = 0. 

If we substitute — q\f M for qY M, it will not change the 
type-1 and type-2 terms, while type-3 and type-4 terms will 
only have their signs reversed. Therefore, if 

P(p + qV~M + rV~N) = k + l V~M + mY~N + nYWN, 
then 

P (p-q VM + r YN) = k-1VM + m VN-nVMN. 

By going through similar steps with the other combinations 
of signs, we find: if 

P(p + q\fM + r]fN) = k + lVM + mYN + nVMN, 
then 

P(p±qVM-rVN) = k+lVM-mVN-nVMN; 

P (p—q VM + rVN) = k-lVM + mVN — nVMN-, 
P(p-qVM-rVN) = k-lVM—mVN + nVMN; 

if P (p -f qYM + rYN) = 0, then, by virtue of Lemma 2, 
k = l = m = n = 0. But it implies that all the other values 
of P (p ± qVM ± rYN) vanish. ■ 

Let us have a look at examples. 

Example 1. The number 1 + 1^*2 is a quadratic irrational. 
How can an equation be found that generates this irrational. 
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Lemma 3 states that the number 1 — |/*2 is also a root of 
this equation. Hence, the equation is 

(z—l — V2)(z— l+j/2) = 0 
or 

x 2 — 2x — 1=0. 

Example 2. The number j/^2 + j/3 is not a quadratic 
irrational. The equation with integral coefficients that gen¬ 
erates it has, by virtue of Lemma 4, the following roots: 

Xl = V2 + V3; 

x 2 = V' 2 — 1^3; 

* 3 = — Vz+Vs; 

Xl = -V2-V3. 

This equation is, therefore, 

(*_/2-K3)(s-/2J r /3)(z + /2-/3) 

x(x + V2 + V3) = 0, 
or 

x* — 10a; 2 + 1=0. 

Note 1. If the roots are known, the equation can obviously 
be found also by Vieta’s formulas. For the normalized equa¬ 
tion of fourth degree 

X* + PxX 3 + p 2 x 2 + Pax + Pi = 0, 

Vieta’s formulas are 


/>i= —(*i + *2 + *s + a: 4 ); 

p 2 = x,x 2 + x,x s + XiX k + x 2 x 3 -f x 2 x k + x 3 x k ; 

p 3 = — (*2*8*4 + *1*3*4 + *1*2*4 + *l*2*s) 5 

Note 2 . The reader may be somewhat surprised if he tries to 
check whether the equation indeed corresponds to the pre¬ 
scribed roots. In fact, the equation gives: 

# = ± ]/"5 ± 2 |^ 6 . 

At the first glance, this differs from the given roots ±Y 2 ± 
y 3. But in fact ]^3dtj/"2=T^5dt 2]/”6. This can be 


l/* 5-0293 
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checked either by squaring both sides of the equality or by 
employing the so-called formula for transforming complex 
radicals: 


vtwb- Y d±y^ ± y a-ya‘-b ' (27) 

Formula (27) helps only when A 2 — B is an exact square. 
This is not the case in our example. 

4.3.3. Euler’s Theorem. A nonterminating continued fraction 
is said to be periodical if its terms form a periodical sequence. 
For example, such are the fractions 

[0; 1, 1, 1, . . .]; 

[2; 1, 5, 1, 5, 1, 5, . . .]; 

[0; 1, 2, 3, 5, 3, 5, 3, 5, ...L 

The first two fractions are purely periodical , and the third 
is a mixed periodical continued fraction. In this classification 
we ignore the integral component a 0 . What follows is a more 
straightforward definition. 

A nonterminating continued fraction is said to be periodical 
if there exist natural numbers N and k such that 

a n+h = a n 


for any n^ N. 

The following theorem, proved by Leonard Euler in 1737, 
holds for continued fractions. 

Theorem. The value of each periodical continued fraction 
is a quadratic irrational. 

^ Let us consider two examples. 

Example 1. [0; 1, 1, 1, . . .]. We have 


Let us apply to this equality an operation of “rewinding”, 
consisting of (1) taking the reciprocal of each side, and (2) 
subtracting the integral part (entier) from each side. In this 


66 



particular case these steps are only made once: 

± =1+ _i_. 

a . 1 


a ^ 


What we have now on the right is the initial fraction, 
that is, a: 

— — l = a, 
a ’ 

which gives us a quadratic equation for a: 

a 2 4- a _ l = 0, 

whence a = — (°f course, the negative root must 

be rejected). 

This analysis shows that any fraction of the type [0; a, a, 
a, . . .] represents a quadratic irrational. 

And what if the period consists not of one numerical but of 
k numerals? Then k pairs of steps will be made in “rewinding”. 
Example 2. [0; 1, 2, 1, 2, . . .]. 


1=- 

a 


—-1 
a 


-2 = 


-L-i 

a 


5* 
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or 

a 2 -f-2a — 2 = 0 

whence 

a-1 +]/’3. 

Note that the root cannot happen to be rational whatever 
the period because the initial continued fraction is nonter¬ 
minating. 

But what happens if a 0 is nonzero? In this case we trans¬ 
pose a 0 to the left-hand side, and begin “rewinding”. ■ 

However, this method is cumbersome if the period is long. For 
this reason we shall give another proof, not as lucid as the one above 
but a short one. 

^ Let a nonterminating continued fraction a = [0; a iv a„ . . .] 
be a purely periodical one, with the period length equal to k. Then 
a = a ft +! (recall that a h + x is the (k + l)th complete quotient): 

a=[0; flj, flgt •••* a ht fl i» fl a> •••]• 



Formula (9) yields 

a _ Ph a k+i + Ph-i 
qh^k+i+qh-i 

Hence, 

Pha + Pk-i 

9fc a + 9fc-i ’ 

that is, a satisfies a quadratic equation 

Ik* 2 + (Qk-i — Ph) * — Pk-i = 0* (28) 

The roots of this equation have opposite signs, and a is the positive 
root. 

If the fraction is a mixed periodical one, 

a = l a o*» a i> •••> a Ni a N+ 1> •••* a N+hj • •♦]# 

period 

we must first “rewind” from right to left the first part of the fraction 
up to the term a N , inclusive, and then apply the proof as given above. ■ 
Note . The number a is irrational because it is represented by a non¬ 
terminating continued fraction. Consequently, the discriminant of 
equation (28) must not be an exact square. This statement can be 
tested by a direct evaluation: 

D = (Ph—Qh- 1) 2 + 4 Pft-i?fc = Ph ~ 2pfc?ft-i + + 4 Pfc- i9h =■•• 

By adding and subtracting the term 4 PhQk-i we find: 

. • • = P\ + 2 Ph<lh-l + — 4 Pft9fc-1 + 4 Pft-l?h 

= (Pft + 9ft-i) 2 4 9fc-i9ft (-g—= - 
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At this juncture we apply formula (13): 

... = (Ph+qh- i)*+4.(-l)ft. 

Finally, we have 

D = ( Ph + q h - i)* + 4.(-l)ft 
or 

D—(Pk + Qh- i) 2 = =b4. 

We see that D is not an exact square. The difference between squares 
of natural numbers cannot equal 4. If the set of natural numbers 
is supplemented with zero, there will be found a unique pair of squares 
spaced by 4 : 0 and 4. 

4.3.4. Lagrange Theorem. As you could see in the preceding 
subsection, Euler’s theorem is proved quite easily. The in¬ 
verse theorem is considerably more difficult to prove. It has 
been proved by Lagrange in 1770. 

Lagrange Theorem. Each quadratic irrational is represented 
by a periodic continued fraction . 

Lagrange succeeded in proving the theorem in a very complicated 
manner. Quite a few mathematicians attempted to simplify the proof 
but to return the original Lagrange’s idea. A hundred-odd years later 
the French mathematician Charves suggested a simpler proof based 
on a different idea. First we shall outline Charves’ idea, and then 
give the detailed proof. 

Let a be a quadratic irrational. Let us expand it into a continued 
fraction interrupting the process at each step, beginning with the 
second step: 

a = K; fli I a 2 ] = K; *n ** I a«] 

= ... = [fl 0 ; fl u fl 2» • • • * 1 I a nl = • ♦ • 

Here a 2 , a,, . . ., a n , . . . are complete quotients. We were able 
to see in Subsection 3.1.5 that if some complete quotient happens to 
repeat, that is, if we find that a n = a n+ft , the continued fraction will 
be periodical. 

We shall prove, first, that each term satisfies a quadratic equation 
with integral coefficients: 

A n cc\ + B n a n + C n = 0. (29) 

Of course, equation (29) can vary for different values of a n , and 
for this reason the coefficients A, B> C are equipped with subscripts. 
Rather, we should say: each a n satisfies its own quadratic equation 
with integral coefficients. 

Second, we shall prove that the magnitudes of the coefficients in 
(29) are bounded:* 

I A n | <L; \ 

\B n \<M {► (30) 

I C n | < N. J 

* We assume that a quadratic equation with integral coefficients 
is written in the form irreducible to lower terms. Otherwise this state¬ 
ment would be meaningless. 
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Attention, please! Here is Charves’ clever idea. The bounds L , AT, N 
are independent of n (they depend exclusively on a). Since A n , B n , C n 
are integers, only a finite number of admissible values exist for them. 
Consequently, for each given a the number of possible equations (29) 
and hence, the number of possible roots of these equations is finite. 
Obviously, the sequence of complete quotients a 2 , a 3 , . . a n , . . . 
will inevitably start to repeat itself; this has to be proved. 

► Now let us implement this plan. First we prove (29), and then (30). 

The quadratic irrational a satisfies a certain quadratic equation 
with integral coefficients. 

Act 2 + Ba + C = 0. (31) 


By virtue of (9), 


Pn-l&n ~f~ Pn-2 
0n-l a n + 0n-2 


(32) 


We substitute (32) into (31) and eliminate the denominator: 


A (Pn-l«n + Pn- 2 ) 2 + B (Pn-l^n + Pn- 2 ) (?n-l«n + 0n- 2 ) 

-\-C (9n-l a n + ?n- 2 ) 2 = 0 


or 

where 


•^n a n “I" B n (X n — 0, 


A n = Apl.y+Bp n . l q n . 1 + Cq\. 'j 

Bn = 2Ap n _iP n _ 2 ~\~B (Pn-i0n-2“bPn- 2 9n-l)"h 2C<7 n -i(7n- 2 » r (33) 

= ^Pn-2 “f ^P7i- 2 ? n-2 + ^?n -2 • J 

It remains for us to prove that coefficients (33) have bounded mag¬ 
nitudes. From (14), 


Pn-l 

Qn-i 



This can be rewritten in the form 


Pn-l 

071-1 



where —1 < 6 <C 1, whence 


Pn-l = a( in-l "+ 


6 


071-1 


( —1 <6< 1). 


Let us substitute this expression for p n ^ x into the first formula 
of (33): 

A n = A (ag n _j + -^-j-) 2 + £ ( a 9n-i + + 

= 0*-i (Aa* + Ba + C) + 2A& + BS+-^-= ... 

Hn -1 
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Obviously, the expression in parentheses vanishes owing to (31): 

...= (24a + fi + -^-)s. 

' 9n-i ' 

But | 6 | < 1, so that 

Aft 

\A n \< 2Aa+B+. 

Hn-i 

We shall also take into account that q\_ x >1 (q* = 1, and the 
sequence q n is strictly increasing). The inequality is only strengthened 
if we drop q'h-i (i-e. replace it by unity): 

Mn I < I 2Aa+B + Aft | < | 2Aft | + | B | + \A \ • \ ft \ 
<\2Aa\ + \B\ + \A\. 

Our goal is attained: we have indicated for | A n | a bound inde¬ 
pendent of n. 

Instead of conducting similar manipulations for \C n \, we note 
that C n is obtained from A n by replacing n by n — 1, that is, C n = 
An- 1 * The bound established above is independent of n> and thus 
is valid for C n . As for B nf a detour will be more effective. Let us cal¬ 
culate the discriminant of equation (29) by using formula (33).We 
omit here a long but dull series of manipulations* and give the final 
result: 

Bfi 4A n C n = (Pn-l?n-2 Pn-2?n-l) 2 )(^ 2 — ^AC). 

But formula (12) states that 

Pn-lQn-l~ Pn-2<ln-l = (“ l) n_2 * 

Hence, 

B* n - lA n C n = B* - 4AC. (34) 

This formula expresses a natural fact: when a quadratic irrational a 
is expanded into a continued fraction, the complete quotients are 
quadratic irrationals of the same nature as a is. This nature is deter¬ 
mined by the discriminant. They are all of the type 

«n = ^n + ^n V D 

for the same D. 

Now we conclude from (34) that 

Bl = B* - 4AC + 4A n C n . 

All terms in the right-hand side being bounded, B% and with it 
lB n I are bounded as well. ■ 

Euler’s and Lagrange theorems can be merged in the fol¬ 
lowing formulation: Quadratic irrationals , and only they , are 
represented by periodical continued fractions . 

* We leave these manipulations to the reader. A mathematician 
should be patient and unafraid of long chains of transformations. 



Chapter 5 

Approximation of Real Numbers 


5.1. Approximation by Convergents 

5.1.1. High-Quality Approximation. The tiring march has 
finally brought us to the goal of our journey. This chapter 
will disclose what purpose is served by continued fractions. 

In Section 1.1.1 we have interpreted the problem of approx¬ 
imation in a very broad sense. Now we switch to a more 
specific problem. Take the set R of real numbers*, and single 
out in R the subset M q of all fractions with denominators not 
greater than q. The problem is to find for each number a 6 R 
the closest to it number r £ M q . 

Assume now that we were able to find such a number, that 
is, we found an approximation a & r. The utility of this 
approximation consists in that the accuracy cannot be improved 
without increasing the denominator : indeed, r is the closest to 
a number in M q . 

Note that if we chose to take the set of fractions with deno¬ 
minators exactly equal to q, this approximation would not, in 
general, have high quality in the sense outlined above. For 
example, we see from Table 1 of Subsection 1.1.4 that the 
approximation of jx in units of 1/10 has low quality in com¬ 
parison with larger fractional units, namely, 1/9, 1/8, 1/7, 
and 1/6. 

The concept of “quality” does not have a single sharply 
defined meaning in approximation theory, so that we have to 
specify each time in what sense this term is being used. 

5.1.2. The Main Property of Convergents. We can define 
the best rational approximation of a number a as a fraction 

£ which provides a lower absolute error than any other 


* It is sufficient to consider only the set of positive real numbers 
because nothing principally new arises from adding negative numbers: 

22 , 22 
if jt ^ , then — ji «- — • 
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fraction with a denominator (obviously, in this sense, the 
best approximation is not unique). With this definition, 
convergents provide the best approximations for a continued 
fraction . Sometimes this property is said to constitute the 
main property of convergents . Let us give it the following for¬ 
mulation: 

Theorem. If— (n^ 1) is a convergent for a number a, an 
Qn 

is any other fraction with q ^ q n , then 


a—— 

< a —T 

Qn 

Q 


This means that the convergent gives an approximation that 
cannot be improved without increasing the denominator. 


-f- 

A 



In 




.An*. 


Pn+l 

Hn-tl 


\ 

B 


Fig. 12 


^ Consider two cases: (1) q < q n an d (2) q = q n (we shall see 
that the second case is trivial). 

(1) The number a belongs to a segment between two con¬ 
vergents — and Pn — (Fig. 12). The length of this segment 

Qn 7n+l 

is |A n | = -. The point a may either be an internal 

QnQn+l 

point of this segment or coincide with (if a is a ratio- 

Qn+i 

nal and Pn ^- is the last convergent). Therefore, 


«--g- <| 


Let y be an arbitrary fraction whose denominator is less 
than q ny and hence, certainly less than q n + x : 


Q<Qn< Qn+f 


6-0293 
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Let us evaluate the distance from - to the ends of the seg¬ 


ment T —, 
L Qn 


Pn +1 1 . 
Qn+1 J ' 


_P 

Q 


Pn 

Qn 


I PQn — PnQ I 
QQn 



P Pn+i I PQn+l Pn+iQ I 1 

Q Qn +l QQn+l ^ QQn+l 


These inequalities are only strengthened if q in the 
right-hand sides are replaced by q n and q n + 1 : 


_P 

P. 


— 
Qn I 

Pn+l 

<7n+l 


> 


Qn+lQn 

1 


> 


QnQn+l 



(35) 


Inequalities (35) signify that each end of segment 

\~^ L ’ g 71 * 1 '] an( f P 0 inf “■ are separated by a distance 

greater than the length pf this segment, |A n |. Marking 
off the segment | A n | to the left and to the right of points 

— and —+ 1 (Fig. 12) we obtain the forbidden zone [AB] = 

Qn Qn +1 

——A„, In±L _(_ A 1 in which the fraction — cannot 

- Qn Qn+1 J Q 

: r all (since points A and B are also forbidden). Now it is 

clear that y is a poorer approximation for a than . 
Indeed, 


a— f|>|A, |. 


Hence 


a -t'< 



(?<?«)• 


(2) Now we shall analyse the case of q = q n . Is it possible 
for another fraction with the same denominator to provide a 
better or an equally good approximation than the conver¬ 
gent? In other words, can it happen that 





(P ^ Pn )• 
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For the sake of definiteness we assume that lies to the 

Qn 

left of a (Fig. 13), that is, n is even (arguments are quite sim¬ 
ilar if n is odd). Can a be closer to Pn ^~ - than to , or at 

Qn Qn 


Pn Vn+1 

Qn U Qn 


Fig. 13 


least lie in the middle, that is, is it possible that 

Qn Qn 

This is equivalent to 

1 


Qn " 2 q n * 
On the other hand, we know that 


Qn QnQn+l 


(36) 

(37) 


As follows from inequalities (36) and (37), 

_L<_L_ 

2 Qn QnQn+l ' 

that is, q n + t ^ 2. 

It means that inequality (36) is only possible if q n+l = 1 or 
q n+1 = 2. This situation occurs only for n = 0 and n = 1. 

The following example shows that inequality (36) can in¬ 
deed hold under these conditions: 


[ 2 ; 2 ] = 2 + -|-. 


2 3 

In this example \ • The fraction ^ although it is not a 

convergent provides an approximation that is just as good. No 
such example can be given for n = 1 because the last term 
of a continued fraction cannot be a unity. ■ 

Note that the converse theorem does not hold, that is, the 
property proved above is not exclusive to convergents. There 
exist fractions that are not convergents but nevertheless give 
a better approximation of a number a than any fraction with 
a smaller denominator. For example, in Subsection 3.1.1 we 
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have listed the convergents of 27 = [2; 3, 1, 6 ]. The reader 
can verify* that the approximations 


61 34 , 61 43 

27 ~ 15 and 27 ~ 19 


are the best. The absolute errors for them, 
61 34 

. I /V/ MINI/ I__ 

27 19 


27 


15 


0.007, 0.004, 


are less than those for any fraction with a smaller denominator, 
although these two fractions are not convergents. The theorem 
as proved does not mean, therefore, that convergents provide 
the best approximation of real numbers. 

5.1.3. Convergents Have the Highest Quality. If the quality 
of approximation is evaluated in a different sense, as described 
in Subsection 1.1.4 then convergents have no competitors. 
The point is that it would be unfair to evaluate the quality 

of an approximation by the quantity a —independently 

of the magnitude of the fractional unit. Higher accuracy can 
be demanded from smaller fractional units (i.e. from larger q). 
Consequently, it is desirable to estimate the absolute error 

on a ^-dependent scale, for example, multiplying 
by q. The result is the normalized absolute error 


a — 


_P_ 
Q 

a—— 


see formula ( 1)1 

h= | qa — p | 

or the quality factor [see formula ( 2 )] 

1 1 


2 h 


2 | qa — p 


Judging by these characteristics it is the convergents, and 
nothing but them , that have the highest quality: the norma¬ 
lized absolute error of a convergent is smaller (and hence, the 
quality factor is greater) than in all other fractions with smal¬ 
ler (or identical) denominators. 

But this is not yet the end. Convergents are found to have a 
quality higher than not only that of fractions with smaller or 
equal denominators but even of fractions with the nearest 
greater denominators: quality will not be increased by in- 


• See footnote on p. 71. 
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creasing the denominator until we come to the denominator of 
the next convergent. 

These arguments hide two trivial exceptions that will sur¬ 
face in the course of analysis. 

Now we shall summarize the above reasoning into two the¬ 
orems converse to each other. 

Theorem 1. If is a convergent for a number a and is 
an arbitrary fraction with q <iq n + 1 > then 

I In a -Pn Kl q* — P I- 


The equality sign occurs only if: (1) a= Pn — , that is , is 

Qn+i Qn 

the last-but-one convergent , and (2) n = 0, a = [a 0 ; 2], 


► Note that ~ is a different fraction, that is, we eliminate the case 

of no interest, — = — . Hereafter we assume that the fraction — 
q Qn q 

is irreducible to lower terms. 

We shall consider separately two cases: (1) 0 < q < <7 n +i, q ¥= <7 n > 
and (2) q = q n . 

(1) Let us represent p and q by identical linear combinations (with 
identical coefficients) of the corresponding terms of the convergents 


Pn 

qn 


and 


Pn +1 
<7n+l 


that is, 


q n x+q n +iy = q\ 1 

p n x+p n + 1 y=p- J 


(38) 


This system yields the coefficients x and y. 

The determinant of (38), p n +\q n — p n q n +u is recognizable from 
formula (12): 

P*n~ Pn+l9n 1 = ( l) n * 


System (38) determines the pair of numbers x and y in a single¬ 
valued manner because D n 0. Besides, | D n \ = 1, and we conclude 
that x and y are integers. 

Both x and y are nonzero. Indeed, if x = 0, then system (38) gives 


y = 1 (because both fractions — and n ^ 1 are irreducible to lower 

Q qn+i 

terms) and q = q n +i> in contradiction with the imposed condition. 
And if y = 0, we likewise obtain q = q nl which is the case to be ana¬ 
lysed later. 

The coefficients x and y cannot be of like signs. If x > 0 and y > 0, 
then the first equation in (38) would give q > g n +i- If x < 0 and 
y < 0, then p and q would be negative. Hence, x and y are of unlike 
signs. 


In order to find the normalized absolute error for the fraction — , 

q 

we begin with multiplying the first equation by a and subtracting 
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from it the second equation: 

(Qn^—Pn) *+(?n+ia— Pn+i) V=qO. — p. (39) 

The differences in the parentheses in the left-hand side of equation 
(39) are of unlike signs (because the convergents — and ^ >n ^ 1 aproxi- 

Qn Qn+l 

mate a on the opposite sides). The numbers x and y are also of unlike 
signs. Both terms in the left-hand side are therefore positive (more 
precisely, the first is strictly positive and the second is nonnegative). 
Hence, 

I Qn^ — Pn H * l + l Qn+lV — Pn+i M y 1 = 1 QCC — p |. 

Therefore, 

I 9n a Pn I ^ I — P I, (40) 

which as to be proved. 

Now we shall determine the conditions corresponding to the equal¬ 
ity sign in (40). As follows from the analysis above, it is possible 
only if 

(7n+i a Pn +1 = 0; 1 

I X 1 = 1. J 1 ' 

Let us analyse the case (41) in more detail. If x = 1, we shall 
have y < 0. But then the first equation in (38) would yield q < 0. 
Hence, x =/= 1 and therefore, x = —1. This entails y = 1. Indeed, 
if we assume y > 1, then the first equation in (38) can be rewritten 
as follows: 

— 7n + (Zn+i + 7n+i (y~~ 1) = 7 
which implies q > q n + v 

The mandatory situation in the case (41) is thus x = —1, y 
that is, 

q=^ 

P~ Pn+l" 


?—9n+1 (7n> 1 

?= Pn+l Pn • / 


= 1, 
(42) 


Pn • 

This entails the equality sign in (40). 

Note that the first condition in (42) can be transformed to 

q = a n+iQn H" 9n+i Qn — ( a n+1 1) Qn~ f* 9n-i* 

In the case under consideration a n +x is the last term of the con¬ 
tinued fraction, and hence, a n+1 > 2. The last equality therefore 
implies 

q > q n - 

Consequently, the equality in (40) cannot occur when q < q n . 

(2) Now we shall consider the case of q = q n . We already know 
from Subsection 5.1.1 that in this case, for p p ny 


a _ Pn 

^ „ P 

<. a- 

qn 1 

qn 


By multiplying both sides of this inequality by q n we obtain 
I qn^ — Pn I < I Qn^ — P I, 

which completes the prove (of course, the exceptional case, possible 
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when n = 0, is retained here as well). The theorem has thus been 
proved for all g < g n+1 . ■ 

Theorem 2 (converse). If the normalized absolute error for a number a 
and a fraction ~ is less than that for any other fraction and g' ^ g, 

then -y is a convergent for a. 

► We assume, as always, that the fraction ~ is irreducible to lower 

terms. Besides, if a is a rational, a = Pn+1 1 then q cannot be greater 

Qn+l 

than q n +i because the normalized absolute error for the fraction Pn * x 

is zero, while according to the initial condition it should be greater 
than | qa — p | . 

Assume that — is not a convergent. Then its denominator lies 

somewhere between the denominators of two neighbouring conver- 
gents, that is, 

Qn < Q < (7n+1* 

The direct theorem then gives 


I Qn^—Pn I < I Q<*— p |. 

This contradicts the condition stated in the theorem: since q n < g, 

then —must give a smaller normalized absolute error than — . The 
Q Qn 


assumption that —■ is not a convergent is therefore false. I 


Note 1. We have proved that conuergents , and only conver- 
gents, provide a smaller normalized absolute error and hence , a 
greater quality factor , than all other fractions with smaller de¬ 
nominators,. 

Why “with smaller denominators’’ only? Could it be true for 
fractions with slightly greater denominators? 

No, it could not. Only the direct theorem holds for deno¬ 
minators q in the interval q n < q < q n +n and it cannot be 
converted. 

Note 2 . Let us consider in more detail the case (41): 

a= £n ±jL) p = p n+i ~p a , q=q n +i — q n - 

Qn+l 

We shall directly demonstrate that although the fraction 

P __ Pn+i Pn 
Q Qn +i Qn 

is not a convergent, and although q n <C g < (fa+i, this frac¬ 
tion nevertheless has the same quality as the convergent^ 2 . 

' ' Qn 
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The manipulations that follow need not be explained: 
I g<* — p I =|(?n+l — Pn+i + Pn | 


I 9n<* — Pn I = 



PnQn+i QnPn+l 
Qn+i 


Pn+i 
Qn+1 




QnPn+i PnQn+i 
Qn +1 


1 

7n+I * 


that is, | qa — p | = | q n a — p n |. Recall that this would be 
impossible in the case q < q n : it implies | gcc — p | < 
I — Pn I- 

For example, the consecutive convergents for the fraction 
61 

a = ^ are (see Subsection 3.1.1) 

Po__2_ Pi _ 7 Pa_9 Pa _ 61 

?o _ 1 ’ ?i ~ 3 * g 2 ~ 4 • g, — 27 * 

The fraction —— Pa ~ Pa = j^ has the same quality as 
g g 8 —g 2 23 ^ 

despite the fact that 4 < 23 < 27. 

TVote 5. Let us compare the approximations of the fraction 

61 

a = by fractions with denominators 1, 2, 3, 4 (Table 2). 

Table 2 


Q 

Approximate 
value of a 

Normalized 
absolute 
error h 

Quality factor y 

4 

2 

7 

CO 

t— 

1 

1 

27 

14 14 

0 

5 

13 

27 1 


2 

27 

26 26 

3 

7 

2 

9 —2 1 


3 

9 

4 4 

4 

9 

1 

27 = 134 - 


4 

27 

2 2 


By looking at this table we can determine, without expand - 

61 9 

ing the number ^ into a continued fraction, that^ is its con- 

9 

vergent: the quality factor of ^ is greater than all preceding 

7 5 

factors. The same is true for the fraction g . However, ^ is not 

2 

a convergent because its quality factor is less than that of j, 
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Chapter 6 

Solutions 


6.1. The Mystery of Archimedes’ Number 

6.1.1. The Key to All Puzzles. The readers who have worked 
their way through Chapters 1, 2, 3, 4 and 5 will now be re¬ 
warded. We are ready to explain the puzzles of Chapter 1. 

In fact, this booklet has been written for the sake of one 
short conclusion: If you want to approximate a real number 
with high accuracy by a fraction with sufficiently simple de¬ 
nominator, replace it with convergents. 

Thus we solve both Archimedes’ problem and the problem 
of the calendar. 

Note that Christian Huygens came to continued fractions 
when trying to approximate real numbers by sufficiently sim¬ 
ple fractions. He needed to construct a model of the Solar 
system in which planets were modelled by gear-wheels. To 
reproduce revolution periods with sufficiently high accuracy 
wheels had to have staggeringly high numbers of teeth. Huy¬ 
gens looked for, and found, a general method of solving such 
problems: the substitution of much smaller numbers for the 
large ones, reproducing their ratios as accurately as possible. 
In this way he invented continued fractions as an auxiliary 
tool, and discovered many of their properties although Raffael 
Bombelli in Italy had operated with these fractions (in a more 
superficial way) a hundred years earlier. 

N. N. Luzin used to say in such cases that “even chips and 
shavings are valuable in the laboratory of a great scien¬ 
tist.” 

6.1.2. The Secret of Archimedes’ Number. To find approxi¬ 
mations of the number n we expand it into a continued frac¬ 
tion. We can take its decimal approximation with high accu¬ 
racy, for example, 3.14159265 = ioooooooo ’ an( * 
Euclid’s algorithm: 

n - (3; 7, 15, 1, 288, 1, . . J. 
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Now we calculate convergents by the method of Subsection 

3.1.4: 


n 

0 

i 

2 

3 

4 

a n 

3 

7 

15 

1 

288 

Pn 

3 

22 

333 

355 

102 595 

Qn 

1 

7 

106 

113 

32 657 


And this is all, as simple as that. This table exposes Archi¬ 
medes’ secret, as well as that of Metius. The table demon¬ 
strates: 


Approxima¬ 

tion 

Pn 

Convergent - 

q n 

0th 

3 (by defect) 

1st 

~Y (by excess) 

2nd 

333 , . . 

jog (by defect) 

3rd 

jjjj (by excess) 


Can it really be said that Archimedes and Metius are at 
last “exposed”: they had employed continued fractions, with 

Archimedes using the convergent and Metius the conver¬ 
gent — ? 

Qs 

No, it cannot, at least not about Archimedes. 

It should be clear to the reader that the problem we have 
solved is one of mathematics, not of history. We have demon¬ 
strated how one could have come to the approximation of n 
22 

by a fraction -y , but this does not mean that Archimedes did 

use this approach. In fact, it cannot be ruled out that he had 
used the continued fractions algorithm. This conjecture is 
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supported by two arguments: (1) this is the most natural 
approach when decimals have not yet been invented and (2) 
the ancients preferred fractions with unity for the numerator. 
Only such fractions were in use in Egypt and Babylon, and 
other fractions were gaining recognition only very slowly. 
Nevertheless, these are merely speculative arguments, and 
would be rejected at any court of law. No direct evidence has 
been found. In order to evaluate n Archimedes calculated 
the perimeters of inscribed and circumscribed regular poly¬ 
gons, using the “duplication formula”. We do not know how 
Archimedes had extracted roots, for he only gave the final 
result. Historians were unable to come to a universally accep¬ 
table conclusion on this subject. 

The advantages of fractions with denominator 7 can be dis¬ 
covered empirically, while they are compared to fractions 
with different denominators. But Metius (or rather, Antonis- 
zoon) could not act this way. It would hardly be possible to 

355 

find the complicated fraction without a theory. There is 


virtually no doubt that Antoniszoon resorted to continued 
fractions. It is perfectly clear why he stopped with the conver- 


355 

gent In fact, this is the last acceptable fraction. The 


, 102 595 

next one, 32 65? 


is so cumbersome that it cannot have any 


practical significance. 


6.2. The Solution to the Calendar Problem 

6.2.1. The Use of Continued Fractions. Let us think first 
how we ourselves would solve the problem of alternation of 
ordinary and leap years. We would represent the duration of 
the year by a continued fraction 

1 year = 365 days 5 hours 48 minutes 46 seconds 
= [365; 4, 7, 1, 3, 5, 64) days. 

Note 1 . The number jt is an irrational and is represented 
by a nonterminating continued fraction. The length of the 
year is an empirical quantity. All empirical quantities are 
measured with errors, so it would be meaningless to consider 
them as rational or irrational. The length of the year as given 
above is the adopted value and we have to treat it as exact. 
It is given by a terminating continued fraction. 
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Note 2. We need not express the length of the year by a de¬ 
cimal in fractions of one day (by analogy with what we did 
in the case of n) if we wish to represent this duration by a 
continued fraction. The calculations are carried out as follows 
(we have dropped the integral component): 

5 h 48 m 46 s_20 926 s_10 463. 

1 day 86 400 s 43 200 ’ 

43 200 = 4*10 463+1348; 

10 463 = 7*1348+1027; 

1348= 1.1027 + 321; 

1027 = 3*321 + 64; 

321 = 5*64+1; 

64 = 64-1. 


Let us find several convergents of the continued fraction 
representing the length of the year. The integral part can be 
omitted because we need to remind that each year contains 
365 full days: 


4 

7 

1 

3 

5 

1 

7 

8 

31 

163 

4 

29 

33 

128 

673 


Each column gives a solution to the calendar problem. For 
example, the first column gives to the year an approximate 
\ 

length of 365 j days. This duration is achieved by setting one 

year in four as a leap year. In general, the third row gives 
the length of the cycle (or period), and the second row gives 
the number of leap years per cycle. For example, the second 
column prescribes the following solution: seven leap years in 
a 29-years cycle. This corresponds to the average duration 
7 

of the year of 365 ^ days. This is a more accurate pattern 

than 365but a more complicated one. 

6.2.2. How to Choose a Calendar. Now it is clear that we 
are offered only four options. 

In order to avoid misunderstandings, we have to remark 
that a very large number of calendars exist in the world. 
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There exist solar and lunar calendars. Different peoples use 
different starting points for counting off the years, different 
numbers of months per year (12 or 13), different (and tremend¬ 
ously diverse) starting dates for the year, and different cele¬ 
bration dates. In the present booklet we do not encompass the 
whole variety of these distinct features, and pick up a single 
aspect, namely, the average length of the year. There are only 
four acceptably simple and exact options. They are given by 
the first four columns of the table above. Combinations stem¬ 
ming from the fifth, etc. columns are far too complicated. The 
possible—and acceptable—options are thus listed in Table 3. 


Table 3 


Option 

No. 

Alternatl 

ye 

number 
of leap 
year s 

on of leap 
ars 

period 

Average length 
of a year 

Error 

1 

1 

4 

365 d 6 h 00 m 00 s 

—11 m 14 s 

2 

7 

29 

365 d 5 h 47 m 35 s 

+1 m 11 s 

3 

8 

33 

365 h 5 h 49 m 05 s 

—19 s 

4 

31 

128 

365 d 5 h 48 m 45 s 

+1 s 


The minus sign in the Error column indicates that the 
average duration of the year is greater than the true value. 

The first option is the Julian calendar. 

The second option is not expedient. It is as complicated as 
the third option being much less accurate. 

The third option (8 leap years in a 33-years cycle) has been 
proposed by the great Persian and Tajik scholar and poet Omar 
Khayyam in 1079. 

The fourth option is exceptionally accurate. The error of 1 s 
is of no practical significance. This calendar has therefore 
been proposed; for example, the Russian astronomer Medler 
suggested in 1864 to introduce it in Russia from the beginning 
of the 20th century. It called for only one correction to the 
Gregorian calendar: to jump one leap year every 128 years 
(i.e. treat this year as an ordinary one). Indeed, the Julian 
calendar contains 32 leap years per a 128-years cycle. 

However, this calendar has been enacted neither in Rus¬ 
sia nor anywhere else. The likely reasons are that the 128-years 
period is not “rounded-off”, and that people are strongly 
accustomed to the existing calendar. 
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6.2.3. The Secret of Pope Gregory XIII. The preceding 
Subsection did not solve the mystery of Pope Gregory XIII, 
the Gregorian calendar is not found among the four options 
in the table. For this reason, having solved the mathematical 
problem, we shall spend some time with the historical prob¬ 
lem. What were the arguments behind the decision of Pope 
Gregory XIII (or rather, of the commission he had designated)? 

Look at a very appealing hypothesis: Pope Gregory XIII 
leaned to the ratio 31:128 but wanted to replace the period 
of 128 years with something more convenient, and chose for 
this 400 years. If 128 years contain 31 leap years, how many 
are contained in 400 years? The proportion 

31 x 
T28 — 400 

yields x = 96.875 « 97. This is precisely the Gregorian 
calendar: 97 leap years per 400-year cycle. 

Quite conclusive, isn’t it? Unfortunately, it is wrong. 

When arguing out a historical case, including an event in 
the history of science, we must avoid ascribing our modern way 
of reasoning to the scolars of yore. Quite the opposite; we ought 
to try and penetrate their world of ideas and knowledge. Fur¬ 
thermore, speculative arguments of the it-could-quite-likely- 
have-been-like-this type are not popular with historians. We 
need to turn up documents and ascertain that “it has been 
thus and not differently”. A very good deal is known about the 
Gregorian calendar reform, including the scolars who sat on 
the commission given the job of compiling the project of the 
reform. 

The flaw in our speculative reasoning was this: the duration 
of the year was not known in the time of Pope Gregory XIII 
as accurately as we know it nowadays. Gregory XIII’s Com¬ 
mission used the astronomical tables compiled by the Acade¬ 
my of Toledo on the order of King Alphonse X (the Wise) of 
Castile (1221-1284). The length of the year in these tables was 

1 year = 365 days 5 hours 49 minutes 16 seconds. 

Converted to continued fraction, it gives 

1 year = [365; 4, 8, 7, 2, 2, 17]. 

Its convergents (with the integral part omitted) are 

1 8 57 

4 ’ 33 ’ 235 * 
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Pope Gregory XIITs Commission thus could not be aware 
31 

of the ratio » whatever method it chose. 

It has been already mentioned that the average duration of 
the year in the Gregorian calendar is 365 days 5 hours 49 min¬ 
utes 12 seconds, and thus is 27 seconds longer than the true 
duration . But this is our attitude, while Pope Gregory XIII 
was of the opinion that his year was 4 seconds shorter than the 
true duration . We thus find that Pope Gregory XIII’s Com¬ 
mission could be very much satisfied with the accuracy it has 
achieved. 

We should add that nothing points to the use of continued 
fractions by the papal commission; continued fractions re¬ 
mained unknown in Europe at the time. Rather, the commis¬ 
sion came to its decision by the trial-and-error method. Here 
is how this could have been done quite easily. According to 
the Tables of Alphonse X, the Julian year was longer than 
the true year by 10 m 44 s . How many years would it take to 
accumulate an error of one full day? Divide 24 hours by 
10 m 44 s : 

24h 86 400 

10 m 44 s “ 644 ~ 

It is thus necessary to overlook a leap year once in every 
134 years. But this would not be suitable because the coming 

1 

134th year may not be a leap year. But 134 ^ ^-400. Ergo: 

overlook three leap years during 400 years. This gives the 
Gregorian calendar. 
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